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PREFATORY NOTE. 


THE present volume contains all the published papers of Professor 
J. Willard Gibbs except those upon Thermodynamics, which are 
placed in Volume I. of this collection. 

In a few cases slight corrections had been made by the author 
in his own copies of the papers. These changes, together with the 
correction of obvious misprints in the originals, have been incor- 
porated in the present edition without comment. Where for the 
sake of clearness it has seemed desirable to insert a word or two 
in a footnote or in the text itself, the addition has been indicated 
by enclosing it within square brackets [ ], a sign which is otherwise 
used only in the formule. 
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Formation of a new Indeterminate Formula of Motion by the Sub- 
stitution of the Variations of the Components of Acceleration for 
the Variations of the Coordinates in the usual Formula. 


The laws of motion are frequently expressed by an equation of the 


form D [(X — ma) da+(Y— my) sy +(Z—mée)6z2]=0, (1) 
in which 
m denotes the mass of a particle of the system considered, 
its rectangular coordinates, 
the second differential coefficients of the coordinates with 
respect to the time, 
X, Y, Z the components of the forces acting on the particle, 
6x, dy, 6z any arbitrary variations of the coordinates which are 
simultaneously possible, and 
> a summation with respect to all the particles of the system. 


x, Y; 


Z 
z 


It is evident that we may substitute for dx, dy, dz any other 
expressions which are capable of the same and only of the same sets 
of simultaneous values. 

Now if the nature of the system is such that certain functions 
A, B, etc. of the coordinates must be constant, or given functions of 
the time, we have 


dA, ddA dA 
% (Fr ae dy eyt dz éz) =, 

dB dB OB \ (2) 
> (Ge e+ a dy +‘ 82) =0, 

ete. 


These are the equations of condition, to which the variations in 
the general equation of motion (1) are subject. But if A is constant 
or a determined function of the time, the same must be true of 
Aand A. Now 


Oeeiad td aa 
A=3(F8+9, 9 a’) 


mee 94, oA NH, 


and A le Or Ty ‘aes 
A 


2 ON THE FUNDAMENTAL FORMULZ OF DYNAMICS. 


where H represents terms containing only the second differential 
coefficients of A with respect to the coordinates, and the first differ- 
ential coefficients of the coordinates with respect to the time. There- 
fore, if we conceive of a variation affecting the accelerations of the 
particles at the time considered, but not their positions or velocities, 
we have 


o (dda. Aa oes 
sd =z (GE db+Z ay + 48) =0, 


and, in like manner, (3) 


+ GB 62: OD ae De 
OB=3 (Ze da+ Foy + Fe 88) =0, 


etc. 


Comparing these equations with (2), we see that when the accelera- 
tions of the particles are regarded as subject to the variation denoted 
by 6, but not their positions or velocities, the possible values of 6%, dy, 
6% are subject to precisely the same restrictions as the values of da, 
dy, 6z, when the positions of the particles are regarded as variable. 
We may, therefore, write for the general equation of motion 


> [(X —mé) dé +(Y — mij) dij + (Z— mz) dz] =0, (4) 


regarding the positions and velocities of the particles as unaffected by 
the variation denoted by 6,—a condition which may be expressed by 


the equations et, éy =0 Sou) 
2 > > 5 
6x = 0, oy =0, 6Z= v1 e 


We have so far supposed that the conditions which restrict the 
possible motions of the systems may be expressed by equations 
between the coordinates alone or the coordinates and the time. To 
extend the formula of motion to cases in which the conditions are 
expressed by the characters = or =, we may write 


= [(X — mai) da + (VY — mij) dy + (Z—m2Z) 62] S0. (6) 

The conditions which determine the possible values of 63, di, 62 
will not, in such cases, be entirely similar to those which determine 
the possible values of dx, dy, dz, when the coordinates are regarded as 


variable. Nevertheless, the laws of motion are correctly expressed 
by the formula (6), while the formula 


2X -mée)da+(VY —mij) dy +(Z—mz)6z]=0, (7) 
does not, as naturally interpreted, give so complete and accurate an 
expression of the laws of motion. 

This may be illustrated by a simple example. 
Let it be required to find the acceleration of a material point, 
which, at a given instant, is moving with given velocity on the 


— ae 


ON THE FUNDAMENTAL FORMUL.Z OF DYNAMICS, 3 


frictionless surface of a body (which it cannot penetrate, but which 
it may leave), and is acted on by given forces. For simplicity, we 
may suppose that the normal to the surface, drawn outward from the 
moving point at the moment considered, is parallel to the axis of X 
and in the positive direction. The only restriction on the values of 
dx, dy, dz is that 


ox=0. 
Formula (7) will therefore give 
a Sar e 
=m m’ ™m 


The condition that the point shall not penetrate the body gives 
another condition for the value of @ If the point remains upon the 
surface, é must have a certain value NV, determined by the form of 
the surface and the velocity of the point. If the value of @ is less 
than this, the point must penetrate the body. Therefore, 
co. 

But this does not suffice to determine the acceleration of the point. 

Let us now apply formula (6) to the same problem. Since # cannot 
be less than J, if @=N, 8#=0. 


This is the only restriction on the value of 6%, for if ¢> WN, the 
value of d6% is entirely arbitrary. Formula (6), therefore, requires 
that 
ife=N, #24; 
m 
but if@>N, #=2: 
m 


—that is (since # cannot be less than IV), that @ shall be equal to the 


greater of the quantities V and cs or to both, if they are equal,— 


and that eee 


Im? 7m 

The values of «, i, 2 are therefore entirely determined by this 

formula in connection with the conditions afforded by the constraints 
of the system.* 

The following considerations will show that what is true in this 

case is also true in general, when the conditions to which the system 


*The failure of the formula (7) in this case is rather apparent than real; for, 
although the formula apparently allows to 2, at the instant considered, a value 
exceeding both WV and -. it does not allow this for any interval, however short. For 


if #<N, the point will immediately leave the surface, and then the formula requires 


ek 
that «= ca 
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is subject are such that certain functions of the coordinates cannot 
exceed certain limits, either constant or variable with the time. If 
certain values of dé, dij, 62 (with unvaried values of a, y, 2, and @, ¥, 2) 
are simultaneously possible at a given instant, equal or proportional 
values with the same signs must be possible for da, dy, 6z immediately 
after the instant considered, and must satisfy formula (1), and there- 
fore (6), in connection with the values of #, 7, 2, X, Y, Z immediately 
after that instant. The values of «#, i, 2, thus determined, are of 
course the very quantities which we wish to obtain, since the accelera- 
tion of a point at a given instant does not denote anything different 
from its acceleration immediately after that instant. 

For an example of a somewhat different class of cases, we may 
suppose that in a system, otherwise free, # cannot have a negative 
value. Such a condition does not seem to affect the possible values 
of dz, as naturally interpreted in a dynamical problem. Yet, if we 
should regard the value of dz in (7) as arbitrary, we should obtain 

i=—, 

m 
which might be erroneous. But if we regard dx as expressing a 
velocity of which the system, if at rest, would be capable (which is 
not a natural signification of the expression), we should have 6~=0, 
which, with (7), gives 


This is not incorrect, but it leaves the acceleration undetermined. 
If we should regard dx as denoting such a variation of the velocity 
as is possible for the system when it has its given velocity (this also 
is not a natural signification of the expression), formula (7) would 
give the correct value of # except when #=0. In this case (which 
cannot be regarded as exceptional in a problem of this kind), we 
should have év=0, which will leave # undetermined, as before. 

The application of formula (6), in problems of this kind, presents 
no difficulty. From the condition 


t-0, 
we obtain, first, if t=0, ¢=—0, 
then, ift¢=0 and ¢=0, 6#=0, 


which is the only limitation on the value of 64. With this condition 
we deduce from (6) that either 


=), é=0, and 2=—%5 


3 | be 


or Si. 
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That is, if ¢=0, @ has the greater of the values * and 0; otherwise, 
eS 

m 

In cases of this kind also, in which the function which cannot 
exceed a certain value involves the velocities (with or without the 
coordinates), one may easily convince himself that formula (6) is 
always valid, and always sufficient to determine the accelerations 
with the aid of the conditions afforded by the constraints of the 
system. 

But instead of examining such cases in detail, we shall proceed to 
consider the subject from a more general point of view. 


Comparison of the New Formula with the Statical Principle of 
Virtual Velocities—Case of Discontinuous Changes of Velocity. 


Formula (1) has so far served as a point of departure. The general 
validity of this, the received form of the indeterminate equation of 
motion, being assumed, it has been shown that formula (6) will be 
valid and sufficient, even in cases in which both (1) and (7) fail. We 
now proceed to show that the statical principle of virtual velocities, 
when its real signification is carefully considered, leads directly to 
formula (6), or to an analogous formula for the determination of the 
discontinuous changes of velocity, when such occur. This will be the 
case even if we start with the usual analytical expression of the 


principle > (Xox+ Voy+Z0oz)=0, (8) 


to which, at first sight, formula (6) appears less closely related than 
(7). For the variations of the coordinates in this formula must be 
regarded as relating to differences between the configuration which 
the system has at a certain time, and which it will continue to have 
in case of equilibrium, and some other configuration which the system 
might be supposed to have at some subsequent time. These temporal 
relations are not indicated explicitly in the notation, and should not 
be, since the statical problem does not involve the time in any 
quantitative manner. But in a dynamical problem, in which we take 
account of the time, it is hardly natural to use da, dy, dz in the same 
sense. In any problem in which 2, y, 2 are regarded as functions 
of the time, dz, dy, 6z are naturally understood to relate to differences 
‘between the configuration which the system has at a certain time, 
and some other configuration which it might (conceivably) have had 
at that time instead of that which it actually had. 

Now when we suppose a point to have a certain position, specified 
by a, y, 2, at a certain time, its position at that time is no longer a 
subject of hypothesis or of question. It is its future positions which 


6 ON THE FUNDAMENTAL FORMULZ OF DYNAMICS. 


form the subject of inquiry. Its position in the immediate future 
is naturally specified by 

atedt+id?+ete, ytydt+hidt?+ete., z+zdt+4zdt?+ete., 
and we may regard the variations of these expressions as corre- 
sponding to the da, dy, dz of the statical problem. It is evidently 
sufficient to take account of the first term of these expressions of 
which the variation is not zero. Now, 2, y, 2, a8 has already been 
said, are to be regarded as constant. With respect to the terms 
containing #, 7, 4, two cases are to be distinguished, according as 
there is, or is not, a finite change of velocity at the instant considered. 

Let us first consider the most important case, in which there is no 
discontinuous change of velocity. In this case, %, y, 2 are not to be 
regarded as variable (by 6), and the variations of the above ex- 
pressions are represented by 
46% dt?, 406i dt?, 4062 dt’, 
which are, therefore, to be substituted for dx, dy, dz in the general 
formula of equilibrium (8) to adapt it to the conditions of a dynamical 
problem. By this substitution (in which the common factor 4d? 
“may of course be omitted), and the addition of the terms expressing 
the reaction against acceleration, we obtain formula (6). 

But if the circumstances are such that there is (or may be) a 
discontinuity in the values of @, y, 4 at the instant considered, it is 
necessary to distinguish the values of these expressions before and 
after the abrupt change. For this purpose, we may apply 4, ¥, 2 
to the original values, and denote the changed values by “+ Aa, 
y+Ay, 2+A%. The value of « at a time very shortly subsequent 
to the instant considered, will be expressed by x+(#+Aaé)dt+ete., in 
which we may regard A& as subject to the variation denoted by 6. 
The variation of the expression is therefore 6 A¢dt. Instead of — mii, 
which expresses the reaction against acceleration, we need in the 
present case —mA« to express the reaction against the abrupt change 
of velocity. A reaction against such a change of velocity is, of 
course, to be regarded as infinite in intensity in comparison with 


reactions due to acceleration, and ordinary forces (such as cause” 


acceleration) may be neglected in comparison. If, however, we 
conceive of the system as acted on by impulsive forces (ie., such 
as have no finite duration, but are capable of producing finite changes 
of velocity, and are measured numerically by the discontinuities of 
velocity which they produce in the unit of mass), these forces should 
be combined with the reactions due to the discontinuities of velocity 
in the general formula which determines these discontinuities, If 
the impulsive forces are specified by X, Y, Z, the formula will be 


[(X—mAé) 6Ai+(Y—mAy) 6Ay+(Z—mAZ) 6A2)]S0. (9) 


——_ 


et 
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The reader will remark the strict analogy between this formula 
and (6), which would perhaps be more clearly exhibited if we should 
dé dy dz 
dt? dt? di 

But these formule may be established in a much more direct 
manner. For the formula (8), although for many purposes the most 
convenient expression of the principle of virtual velocities, is by no 
means the most convenient for our present purpose. As the usual 
name of the principle implies, it holds true of velocities as well 
as of displacements, and is perhaps more simple and more evident 
when thus applied.* 

If we wish to apply the principle, thus understood, to a moving 
system so as to determine whether certain changes of velocity 
specified by Aw, Ay, Az are those which the system will really receive 
at a given instant, the velocities to be multiplied into the forces and 
reactions in the most simple application of the principle are manifestly 
such as may be imagined to be compounded with the assumed 
velocities, and are therefore properly specified by dAd, dAy, dAzZ. 
The formula (9) may therefore be regarded as the most direct appli- 
cation of the principle of virtual velocities to discontinuous changes 
of velocity in a moving system. 

In the-case of a system in which there are no discontinuous changes 
of velocity, but which is subject to forces tending to produce accelera- 
tions, when we wish to determine whether certain accelerations, 
specified by #, 7, Z, are such as the system will really receive, it is 
evidently necessary to consider whether any possible variation of 
these accelerations is favored more than it is opposed by the forces 


write for a, i, Z in that formula, 


*Even in Statics, the principle of virtual velocities, as distinguished from that of 
virtual displacements, has a certain advantage in respect of its evidence. The demon- 
stration of the principle in the first section of the Mécanique Analytique, if velocities 
had been considered instead of displacements, would not have been exposed to an 
objection, which has been expressed by M. Bertrand in the following words: ‘‘On a 
objecté, avec raison, 4 cette assertion de Lagrange l’example d’un point pesant en 
équilibre au sommet le plus élevé d’une courbe; il est évident qu’un déplacement 
infiniment petit le ferait descendre, et, pourtant, ce déplacement ne se produit pas.” 
(Mécanique Analytique, troiseme édition, tome 1, page 22, note de M. Bertrand.) The 
value of z (the height of the point above a horizontal plane) can certainly be diminished 
by a displacement of the point, but the value of 2 is not affected by any velocity given 
to the point. 

The real difficulty in the consideration of displacements is that they are only possible 
at a time subsequent to that in which the system has the configuration to which the 
question of equilibrium relates. We may make the interval of time infinitely short, 
but it will always be difficult, in the establishing of fundamental principles, to treat a 
conception of this kind (relating to what is possible after an infinitesimal interval of 
time) with the same rigor as the idea of velocities or accelerations, which, in the cases 
to which (9) and (6) respectively relate, we may regard as communicated immediately to 


the system. 
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and reactions of the system. The formula (6) expresses a criterion 
of this kind in the most simple and direct manner. If we regard 
a force as a tendency to increase a quantity expressed by &, the 
product of the force by dé is the natural measure of the extent to 
which this tendency is satisfied by an arbitrary variation of the 
accelerations. The principle expressed by the formula may not be 
very accurately designated by the words virtual velocities, but it 
certainly does not differ from the principle of virtual velocities (in 
the stricter sense of the term), more than this differs from that of 
virtual displacements,—a difference so slight that the distinction of 
the names is rarely insisted upon, and that it is often very difficult 
to tell which form of the principle is especially intended, even when 
the principle is enunciated or discussed somewhat at length. 

But, although the formule (6) and (9) differ so little from the 
ordinary formule, they not only have a marked advantage in respect 
of precision and accuracy, but also may be more satisfactory to the 
mind, in that the changes considered (to which 6 relates), are not so 
violently opposed to all the possibilities of the case as are those which 
are represented by the variations of the coordinates.* Moreover, as 
we shall see, they naturally lead to various important laws of motion. 


Transformation of the New Formula. 


Let us now consider some of the transformations of which our 
general formula (6) is capable. If we separate the terms containing 


*TIt may have seemed to some readers of the Mécanique Analytique—a work of which 
the unity of method is one of the most striking characteristics, and that to which its 
universally recognized artistic merit is in great measure due—that the treatment of 
dynamical problems in that work is not entirely analogous to the treatment of statical 
problems. The statical question, whether a system will remain in equilibrium in a 
given configuration, is determined by Lagrange by considering all possible motions of 
the system and inquiring whether there is any reason why the system should take any 
one of them. A similar method in dynamics would be based upon a comparison of a 
proposed motion with all other motions of which the system is capable without violating 
its kinematical conditions. Instead of this, Lagrange virtually reduces the dynamical 
problem to a statical one, and considers, not the possible variations of the proposed 
motion, but the motions which would be possible if the system were at rest. This 
reduction of a given problem to a simpler one, which has already been solved, is a 
method which has its advantages, but it is not the characteristic method of the 
Mécamque Analytique. That which most distinguishes the plan of this treatise from 
the usual type is the direct application of the general principle to each particular case. 

The point is perhaps of small moment, and may be differently regarded by others, 
but it is mentioned here because it was a feeling of this kind (whether justified or not) 
and the desire to express the formula of motion by means of a maximum or minimum 
condition, in which the conditions under which the maximum or minimum subsists 
should be such as the problem naturally affords (Gauss’s principle of least constraint 
being at the time unknown to the present writer, and the conditions under which the 
minimum subsists in the principle of least action being such that that is hardly satis- 
factory as a fundamental principle), which led to the formule proposed in this paper. 
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the masses of the particles from those which contain the forces, we 
have —-&(Xd#+ Vy + 268) —D [4d (42+ 42-42] <0, (10) 
or, if we write w for the acceleration of a particle, 

X (X6%+ Voi + ZdzZ)—d>d (4mu?) = 0. (11) 


If, instead of terms of the form X6éz, or in addition to such terms, 
equation (1) had contained terms of the form Pdp, in which p denotes 
any quantity determined by the configuration of the system, it is 
evident that these would give terms of the form Péj in (6), (10) and 
(11). For the considerations which justified the substitution of dé, 
dy, 62 for da, dy, dz in the usual formula were in no respect dependent 
upon the fact that x, y, z denote rectangular coordinates, but would 
apply equally to any other quantities which are determined by the 
configuration of the system. 

Hence, if the moments of all the forces of the system are represented 


by the sum &(Pdp), 
the general formula of motion may be written 
$ (Pp) —8E(gmu?) 0 (12) 


If the forces admit of a force-function V, we have 
6V—63(4mu?) =0, 
or 6[V— (4mu)]S0. (13) 
But if the forces are determined in any way whatever by the 
configuration and velocities of the system, with or without the time, 
X, Y,Z and P will be unaffected by the variation denoted by 6, and 
we may write the formula of motion in the form 
62(X#+ Vi t+ZZ—4mu’)=0, (14) 
or 6[$(Pp)—E (yma?)] SO. (15) 
If the forces are determined by the configuration alone, or the 


configuration and the time, 6X =0, 6Y=0, 6Z=0, 6P=0, and the 
general formula may be written 


|S S(Xa+ Vyj+Z2)— (ym) | =0, (16) 


or 6 @ S(Pp)- E(ymu?) | = 6, (17) 


The quantity affected by 6 in any one of the last five formule 
has not only a maximum value, but absolutely the greatest value 
consistent with the constraints of the system. This may be shown 
in reference to (15) by giving to j, #, ¥, 4, contained explicitly 
or implicitly in the expression affected by 0, any possible finite 


10 ON THE FUNDAMENTAL FORMULA OF DYNAMICS. 


increments ji’, «, 7’. #, and subtracting the original value of the 
expression from ihe las thus modified. Now, 


S[P(H+P)]—Tlhm{((E+EP+G+YP+E+2Z)] 

— SPP) + Tm! +P+2)] 

= 3(Pp')— Dm a + Vi +22)]— Db (H+ 72 +2) 
But since ji’, «’, 7’, 7 are proportional to and of the same sign with 
possible values of dj, dé, di, 62, we have, by the general formula of 


motion, S(Pjp')— = [mei +97 +22) =0. 


The second member of the preceding equation is therefore negative. 
The first member is therefore negative, which proves the proposition 
with respect to (15). The demonstration is precisely the same with 
respect to (13) and (14), which may be regarded as particular 
cases of (15). 

To show the same with regard to (16) and (17), we have only to 
observe that the quantities affected by 6 in these formule differ from 
those affected by the same symbol in (14) and (15) only by the terms 

Y(Xe+Vy+Z2) and (Pp), 
which will not be affected by any change in the accelerations of 
the system. 

When the forces are determined by the configuration (with or 
without the time), the principle may be enunciated as follows: The 
accelerations in the system are always such that the acceleration of 
the rate of work done by the forces diminished by one-half the sum 
of the products of the masses of the particles by the squares of their 
accelerations has the greatest possible value. 

The formula (17), although in appearance less simple than (15), not 
only is more easily enunciated in words, but has the advantage that 


the quantity a S(Pp) is entirely determined by the system with its 


forces and motions, which is not the case with $(P). The value of 
the latter expression depends upon the manner in which we choose to 
represent the forces. For example, if a material point is revolving in 
a circle under the influence of a central force, we may write either - 
Xi+V¥+Zé or Ri for Pj, R and r denoting respectively the force 
and radius vector. Now X#+Yij+Zz . manifestly unequal to Ri. 


ae Xé+Vy¥+Zz is equal to Rr, and & (Xe+ Yy+2Zz) is equal to 
§(BF). 

It may not be without interest to see what shape our general 
formule will take in one of the most important cases of forces 


dependent upon the velocities. If a body which can be treated as a 
point is moving in a medium which presents a resistance expressed by 


pet Sets | 
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any function of the velocity, the terms due to that resistance in the 
general formula of motion may be expressed in the form 


| ov) 2a+g@ 2 y+ oc? ], 


where v denotes the velocity and $(v) the resistance. But 


ta, Yl, 4 de _ 
= ig » at” 
The terms due to the resistance reduce, therefore, to 
OL g(v)e], 
d 
or, ) ad 


where /f denotes the primitive of the function denoted by ¢. 
Discontinuous Changes of Velocity.—Formula (9), which relates to 
discontinuous changes of velocity, is capable of similar transformations. 
If we set w= Ad?+Ag?2+ Az, 
the formula reduces to 
62(XAé+ YAy + ZAz—4 mw?) =0, (18) 
where X, Y, Z are to be regarded as constant. If $(Pdp) represents 
the sum of the moments of the impulsive forces, and we regard P as 
constant, we have 
6[$(PAp)—Z(gmw*)] S0. (19) 
The expressions affected by 6 in these formule have a greater value 
than they would receive from any other changes of velocity consistent 
with the constraints of the system. 


Deduction of other Properties of Motion. 


The principles which have been established furnish a convenient 
point of departure for the demonstration of various properties of 
motion relating to maxvma and minima. We may obtain several 
such properties by considering how the accelerations of a system, at a 
given instant, will be modified by changes of the forces or of the 
constraints to which the system is subject. Let us suppose that the 
forces X, Y, Z of a system receive the increments X’, Y’, Z’, in con- 
sequence of which, and of certain additional constraints, which do not 
produce any discontinuity in the velocities, the components of accelera- 
tion 4, ij, receive the increments @’, #j’, 7. The expression 

SL X+X)\E+E) + V+ VVG+P) + G+ Z2E+2) 
—im{(é+e P+ (G+ HP+Et2Z)}] (20) 
will be the greatest possible for any values of 4, jj’, 7 consistent with 
the constraints. But this expression may be divided into three parts, 
D[(X4 RX) G+(V+V) G+ 4Z+7)2-1MEO +P +2)]), (AY 
D[ Xa + Vy + Ze — mi’ +5 +22), (22) 
and D[XH +E Vi +29 —h (249? +2). (23) 
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The first part is evidently constant with reference to variations of 
a’, i’, #, and may, therefore, be neglected. With respect to the second 
part we observe that by the general formula of the motion we have 

D[ Xba + Vos + 262 — maa + Voy + 202) ]=0 

for all values of 6a, 6i, 62 which are possible and reversible before the 
addition of the new constraints. But values proportional to «, 7’, 2, 
and of the same sign, are evidently consistent with the original con- 
straints, and when the components of acceleration are altered to 
o+é, +7’, 2+2, variations of these quantities proportional to and 
of the same sign as —#, —4’, —2% are evidently consistent with the 
original constraints. Now if these latter variations were not possible 
before the accelerations were modified by the addition of the new forces 
and constraints, it must be that some constraint was then operative 
which afterwards ceased to be so. The expression (22) will, therefore, 
be equal to zero, provided only that all the constraints which were 
operative before the addition of the new forces and constraints, remain 
operative afterwards.* With this limitation, therefore, the expression 
(23) must have the greatest value consistent with the constraints. 
This principle may be expressed without reference to rectangular 
coordinates. If we write w’ for the relative acceleration due to the 
additional forces and constraints, we have 


UW a= G4 G24 22, 
and expression (23) reduces to 
D(X + Vi’ +Z77 —kmu”). (24) 
If the sum of the moments of the additional forces which are 


considered is represented by $(Qdq) (the q representing quantities 
determined by the configuration of the system), we have 


2(X'é + Y’y+Z'2)=S (Qq). 
. . . 2 
We may distinguish the values of 4 immediately before and imme- 


diately after the application of the additional forces and constraints 


by the expressions g and ¢+@. With this understanding, we have, 


by differentiation of the preceding equation, 
DX i+ Vy t+ Ze+X (+e )4+ VG+H)4FZ (E+2)] 
=8(Q¢+9G4+9)1; 


* As an illustration of the significance of this limitation, we may consider the con- 
dition afforded by the impenetrability of two bodies in contact. Let us suppose that if 
subject only to the original forces and constraints they would continue in contact, but 
that, under the influence of the additional forces’and constraints, the contact will bene 
The impenetrability of the bodies then ceases to be operative as a constraint Such 
cases form an exception to the principle which is to be established. But there are no 
exceptions when all the original constraints are expressed by equations. 


——— 
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whence it appears that D(X’#’+ Yj’ + Z’2’) differs from &(Qiy’) only 
by quantities which are independent of the relative acceleration due 
to the additional forces and constraints. It follows that these relative 
accelerations are such as to make 


& (Q7’)— = (4mu”) (25) 
@& maximum. 


Tt will be observed that the condition which determines these 
relative accelerations is of precisely the same form as that which 
determines absolute accelerations. 

An important case is that in which new constraints are added but 
no new forces. The relative accelerations are determined in this case 
by the condition that ©(4mw”) isa minimum. In any case of motion, 
in which finite forces do not act at points, lines or surfaces, we may 
first calculate the accelerations which would be produced if there were 
no constraints, and then determine the relative accelerations due to 
the constraints by the condition that 2(4mw”) is a minimum. This 
is Gauss’s principle of least constraint.* 

Again, in any case of motion, we may suppose w to denote the 
acceleration which would be produced by the constraints alone, and wu’ 
the relative acceleration produced by the forces; we then have 


> [m (eae + U7’ + 22’) | =0, 
whence, if we write w” for the resultant or actual acceleration, 
> (4mu*)+ > (4mu?) =z (4m). 
_ Moreover, differentiating (25), we obtain 
S (Q5q!)— Elm oi’ + if 8if + ¥62')]=0, 
whence, since 64’, di’, dij’, 62 may have values proportional to q, #, 


a $(Qy)=25 (mu). 
These relations are similar to those which exist with respect to 
vis viva and impulsive forces. 


Particular Equations of Motion. 


From the general formula (12), we may easily obtain particular 
equations which will express the laws of motion in a very general 


form. 
Let dw,, dw,, ete. be infinitesimals (not necessarily complete 
differentials) the values of which are independent, and by means 


*This principle may be derived very directly from the general formula (6), or vice 
versa, for = (4mw) may be put in the form 


eLon{(+-3)(¥-2)-(-2)}} 


the variation of which, with the sign changed, is identical with the first member of (6). 
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of which we can perfectly define any infinitesimal change in the 
configuration of the system; and let 
_do, , dw, 
=a) a 
where dw,, dw, are to be determined by the change in the con- 
figuration in the interval of time dt; and let 


ete., 


Also let U=2(4mvu?). 


It is evident that U can be expressed in terms of @,, @,, etc., 
é,, @,, ete., and the quantities which express the configuration of the 
system, and that (since 6 is used to denote a variation which does 
not affect the configuration or the velocities), 

aU dU 
ae er 665, + ete. 


Moreover, since the quantities p in ss general formula are entirely 
determined by the configuration of the system 


6U= 


Jaa! 
‘ Pee ta eee 
where a denotes the ratio of simultaneous values of dp and dw,, 
when dw,, etc. are equal to zero, and x ete. are to be interpreted 


on the same principle. Multiplying by P, and taking the sum with 
respect to the several forces, we have 


8 (Pp) =Q,6, + Q.6,+ ete., 
dp dp 

h = ma = = 
where 0 3(P in) QO, 3(P los)’ ete. 

If we differentiate with respect to t, and take the variation denoted 
by 6, btai 

Howe OID B(PSp) = 0,86, + 0,80, ete. 
The general formula (12) is thus reduced to the form 


dU dU 
(2,-35) di, +(0,- 75 ) Siiy-tete. = 0. (26) 
If the forces have a potential V, we may write 


Ga Ti ) 6 +(Ge- Tp, dis + ete, (27) 


dV 
where Py denotes the ratio of dV and dw, when dw,, ete. have the 


value zero, and the analogous expressions are to be interpreted on 
the same principle. 


—- 
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If the variations dw,, dw,, etc. are capable both of positive and of 
negative values, we must have 


aU; dU 
ae das, etc., (28) 
au a@V a aV 

or, da, dm, da, de," ete. (29) 


To illustrate the use of these equations in a case in which dw,, dw,, 
etc, are not exact differentials, we may apply them to the problem of 
the rotation of a rigid body of which one point is fixed. If dw,, dw,, 
dw, denote infinitesimal rotations about the principal axes which pass 
through the fixed point, Q,, Q,, Q, will denote the moments of the 
impressed forces about these axes, and the value of U will be given 
by the formula 


2U =(a+b +¢)(@i + + &3) — (Wi +63 + .d5)(Aat + DH} + Cw3) 
+ 2(b —c) doi 363, + 2 (6 — A) dg 165. + 2 (4 — b) d 2655 
+(b+c)ai+(c+a) a+(a+b) 5, 


where a, b, and c¢ are constants, a+b, b+c, c+a being the moments 
of inertia about the three axes. Hence, 


dU 
re eee dia, 0 Meant (C+)ba, 
oo =(a— b) dyn + (a + b) és ; 
3 


and the equations of motion are 


oy (c a b) do03-+ 4 


Me: c+b 

a _ (4-6) d36, +.Q2 
ta ate . 
ig ERD — 4) 0,03+Qs 
(eee ee arn poe 


b+a 


Il. 


ON THE FUNDAMENTAL FORMULA OF STATISTICAL 
MECHANICS, WITH APPLICATIONS TO ASTRONOMY 
AND THERMODYNAMICS. 


[Proceedings of the American Association for the Advancement of Science, 
vol. XXxIU. pp. 57, 58, 1884.| 


(ABSTRACT. ) 


Suppose that we have a great number of systems which consist 
of material points and are identical in character, but different in 
configuration and velocities, and in which the forces are determined 
by the configuration alone. Let the number of systems in which the 


coordinates and velocities lie severally between the following limits, . 


viz., between 
x, and #,+dz,, 
y, and y,+dy,, 
z, and 2,+dz,, 
xe, and 2,+dzmp, 
etc., 
é, and ¢,+dz,, 
y, and ¥,+dy,, 
, and 2,4+dz,, 
%, and &,+dé,, 


ete., 
be denoted by 


L da, dy, dz, dx, ete. dz, dy, dz, da, ete. 


The manner in which the quantity Z varies with the time is given 
by the equation 


a e+ FB x |; 
where t, 1, Y1, %, Ge, ett, &, ¥,, 2; &, ete., are the independent 
variables, and the summation relates to all the coordinates. 

The object of the paper is to establish this proposition (which is not 
claimed as new, but which has hardly received the recognition which 


it deserves) and to show its applications to astronomy and thermo- 
dynamics. 


dL he dL. 


——— 
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ELEMENTS OF VECTOR ANALYSIS. 


[Privately printed, New Haven, pp. 17-50, 1881; pp. 50-90, 1884. ] 


(The fundamental principles of the following analysis are such as are familiar under 
a slightly different form to students of quaternions. The manner in which the subject 
is developed is somewhat different from that followed in treatises on quaternions, since 
the object of the writer does not require any use of the conception of the quaternion, 
being simply to give a suitable notation for those relations between vectors, or between 
vectors and scalars, which seem most important, and which lend themselves most readily 
to analytical transformations, and to explain some of these transformations. As a 
precedent for such a departure from quaternionic usage, Clifford’s Kinematic may be 
cited. In this connection, the name of Grassmann may also be mentioned, to whose 
system the following method attaches itself in some respects more closely than to that 
of Hamilton.) 


CHAPTER I. 


CONCERNING THE ALGEBRA OF VECTORS. 


Fundamental Notions. 


1. Definition.—If anything has magnitude and direction, its mag- 
nitude and direction taken together constitute what is called a vector. 

The numerical description of a vector requires three numbers, but 
nothing prevents us from using a single letter for its symbolical 
designation. An algebra or analytical method in which a single letter 
or other expression is used to specify a vector may be called a vector 
algebra or vector analysis. 

Def.—As distinguished from vectors the real (positive or negative) 
quantities of ordinary algebra are called scalars.* 

As it is convenient that the form of the letter should indicate 


whether a vector or a scalar is denoted, we shall use the small Greek 


letters to denote vectors, and the small English letters to denote 
scalars. (The three letters, 4, 4, k, will make an exception, to be 


mentioned more particularly hereafter. Moreover, 7 will be used in 


its usual scalar sense, to denote the ratio of the circumference of 
a circle to its diameter.) 


*The imaginaries of ordinary algebra may be called biscalars, and that which cor- 


- responds to them in the theory of vectors, bivectors. But we shall have no occasion to 


consider either of these. [See, however, footnote on p. 84.] 
G, II. B 
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2. Def.—Vectors are said to be egual when they are the same both 
in direction and in magnitude. This equality is denoted by the 
ordinary sign, as a=. The reader will observe that this vector 
equation is the equivalent of three scalar equations. 

A vector is said to be equal to zero, when its magnitude is zero. 
Such vectors may be set equal to one another, irrespectively of any 
considerations relating to direction. 

3. Perhaps the most simple example of a vector is afforded by a 
directed straight line, as the line drawn from A to B. We may use 
the notation AB to denote this line as a vector, ie, to denote its 
length and direction without regard to its position in other respects. 
The points A and B may be distinguished as the origin and the 
terminus of the vector. Since any magnitude may be represented by 
a length, any vector may be represented by a directed line; and it 
will often be convenient to use language relating to vectors, which 
refers to them as thus represented. 


Reversal of Direction, Scalar Multiplication and Division. 


4. The negative sign (—) reverses the direction of a vector. (Some- 
times the sign + may be used to call attention to the fact that the 
vector has not the negative sign.) 

Def—A vector is said to be multiplied or divided by a scalar 
when its magnitude is multiplied or divided by the numerical value 
of the scalar and its direction is either unchanged or reversed 
according as the scalar is positive or negative. These operations 
are represented by the same methods as multiplication and division 
in algebra, and are to be regarded as substantially identical with 
them. The terms scalar multiplication and scalar division are used 
to denote multiplication and division by scalars, whether the quantity 
multiplied or divided is a scalar or a vector. 

5. Def—A unit vector is a vector of which the magnitude is unity. 

Any vector may be regarded as the product of a positive scalar 
(the magnitude of the vector) and a unit vector. 

The notation a may be used to denote the magnitude of the 
vector a. 

Addition and Subtraction of Vectors. 


6. Def—The sum of the vectors a, @, ete. (written a+B+ete.) is 
the vector found by the following process. Assuming any point A, 
we determine successively the points B, C, etc., so that AB=a, 
BC=8, ete. The vector drawn from A to the last point thus deter- 
mined is the sum required. This is sometimes called the geometrical 
sum, to distinguish it from an algebraic sum or an arithmetical sum. 
It is also called the resultant, and a, 8, ete. are called the components. 
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When the vectors to be added are all parallel to the same straight line, 

geometrical addition reduces to algebraic; when they have all the same 

direction, geometrical addition like algebraic reduces to arithmetical. 
It may easily be shown that the value of a sum is not affected 


_ by changing the order of two consecutive terms, and therefore that 


it is not affected by any change in the order of the terms. Again, it 
is evident from the definition that the value of a sum is not altered 
by uniting any of its terms in brackets, as a+[@+y]+ete., which is 
in effect to substitute the sum of the terms enclosed for the terms 
themselves among the vectors to be added. In other words, the 
commutative and associative principles of arithmetical and algebraic 
addition hold true of geometrical addition. 

7. Def—A vector is said to be subtracted when it is added after 
reversal of direction. This is indicated by the use of the sign — 
instead of +. 

8. It is easily shown that the distributive principle of arithmetical 
and algebraic multiplication applies to the multiplication of sums of 
vectors by scalars or sums of scalars, i.e., 


(m+n-+ete.) [a+ B+ete.]=ma+na+ete. 
+mB+nB+ete. 
+ ete. 

9. Vector EHquations.—lf we have equations between sums and 
differences of vectors, we may transpose terms in them, multiply or 
divide by any scalar, and add or subtract the equations, precisely as 
in the case of the equations of ordinary algebra. Hence, if we have 
several such equations containing known and unknown vectors, the 
processes of elimination and reduction by which the unknown vectors 
may be expressed in terms of the known are precisely the same, and 
subject to the same limitations, as if the letters representing vectors 
represented scalars. This will be evident if we consider that in the 
multiplications incident to elimination in the supposed scalar equations 
the multipliers are the coefficients of the unknown quantities, or 
functions of these coefficients, and that such multiplications may be 
applied to the vector equations, since the coefficients are scalars. 

10. Linear relation of fowr vectors, Coordinates.—If a, 6, and y 
are any given vectors not parallel to the same plane, any other vector 
p may be expressed in the form 

p=aa+bB+cey. 
If a, 8, and y are unit vectors, a, b, and c are the ordinary scalar 
components of p parallel to a, B, and y. If p=OP, (a, 8, y being 
unit vectors), a, b, and c are the cartesian coordinates of the point P 
referred to axes through O parallel to a, 8,and y. When the values 
of these scalars are given, p is said to be given in terms of a, 8, and y. 
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It is generally in this way that the value of a vector is specified, viz., 
in terms of three known vectors. For such purposes of reference, a 
system of three mutually perpendicular vectors has certain evident 
advantages. 

11. Normal systems of wnit vectors—The letters 7,7, k are appro- 
priated to the designation of a normal system of unit vectors, Lé., 
three unit vectors, each of which is at right angles to the other two 
and determined in direction by them in a perfectly definite manner. 
We shall always suppose that k is on the side of the 77 plane on 
which a rotation from 7 to j (through one right angle) appears 
counter-clockwise. In other words, the directions of 7,7, and & are 
to be so determined that if they be turned (remaining rigidly con- 
nected with each other) so that 7 points to the east, and to the north, 
k will point upward. When rectangular axes of X, Y, and Z are 
employed, their directions will be conformed to a similar condition, 
and 4, j, k (when the contrary is not stated) will be supposed parallel 
to these axes respectively. We may have occasion to use more than 
one such system of unit vectors, just as we may use more than one 
system of coordinate axes. In such cases, the different systems may 
be distinguished by accents or otherwise. 

12. Numerical conyputation of a geometrical swm.—lf 


p= aa + bB + cy, 
c=aat+b'B+e'y, 
ete., 


then 

ptaotete.=(a+a'+ete.)a+(b+b’+etc.) B+(c+c'+etc.) y, 
iLe., the coefficients by which a geometrical sum is expressed in terms 
of three vectors are the sums of the coefficients by which the separate 
terms of the geometrical sum are expressed in terms of the same 
three vectors. 


Direct and Skew Products of Vectors. 


13. Def—The direct product of a and B (written a.) is the scalar 
quantity obtained by multiplying the product of their magnitudes 
by the cosine of the angle made by their directions. 

14. Def—The skew product of a and 8 (written ax) is a vector 
function of a and B. Its magnitude is obtained by multiplying the 
product of the magnitudes of a and 8 by the sine of the angle made 
by their directions. Its direction is at right angles to a and 8, and 
on that side of the plane containing a and @ (supposed drawn from 
a common origin) on which a rotation from a to through an are 
of less than 180° appears counter-clockwise. 

The direction of ax8 may also be defined as that in which an 
ordinary screw advances as it turns so as to carry a toward B 


Sy eee Se aS 
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Again, if a be directed toward the east, and 8 lie in the same 


horizontal plane and on the north side of a, ax B will be directed 
upward, 


» 15. It is evident from the preceding definitions that 
a.B=B6.a, and axB=— xa. 
16. Moreover, [na].6=a.[nB]=n[a.B], 
and [na]xB=ax[nB]=n[ax 8]. 
The brackets may therefore be omitted in such expressions. 
17. From the definitions of No. 11 it appears that 


4.459.) =h:k=1, 
tj=jt=t.k=khi=jkh=kj =0, 
4x1=0, 9Ixj=0, kxk=0, 
xpek, jxkb=t, - kxi=}, 
jxt=—-k, kxj=—i, ixk=—-}. 
18. If we resolve 8 into two components ~’ and 8", of which the 


_ first is parallel and the second perpendicular to a, we shall have 


a.B=a.’ and axB=ax". 

19. af[6+y]=a.8+ay and ax[B+y]=ax6 + axy. 

To prove this, let -=@6+y, and resolve each of the vectors £8, y, « 
into two components, one parallel and the other perpendicular to a. 
Let these be §’, 8’, y’, y’, o, 0”. Then the equations to be proved 
will reduce by the last section to 

acg=a.8+ay and axc’=axP"+axy’. 

Now since c=8+¥ we may form a triangle in space, the sides of 
which shall be B, y, and c. Projecting this on a plane perpendicular 
to a, we obtain a triangle having the sides 6’, y’, and o”, which 
affords the relation o”=$’+y". If we pass planes perpendicular 
to a through the vertices of the first triangle, they will give on a 
line parallel to a segments equal to 8’, y’, o’. Thus we obtain the 
relation o’=$’+y’. Therefore a.c’=a.(’+a.7’, since all the cosines 
involved in these products are equal to unity. Moreover, if a is a 
unit vector, we shall evidently have axo”=axf"+axy”, since the 
effect of the skew multiplication by a upon vectors in a plane 
perpendicular to a is simply to rotate them all 90° in that plane. 
But any case may be reduced to this by dividing both sides of the 
equation to be proved by the magnitude of a. The propositions 
are therefore proved. 

20. Hence, 

[a+ B].y=ayt+f-y, [a+ B]xy=axy+Pxy, 
[a+] [ytél=ayta.6+B.yt+8.6, 
[a+ 6] x[y+d]=axytaxd+Bxyt+ 8x6; 
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and, in general, direct and skew products of sums of vectors may be 
expanded precisely as the products of sums in algebra, except that 
in skew products the order of the factors must not be changed 
without compensation in the sign of the term. If any of the terms 
in the factors have negative signs, the signs of the expanded product 
(when there is no change in the order of the factors) will be deter- 
mined by the same rules as in algebra. It is on account of this 
analogy with algebraic products that these functions of vectors are 
called products and that other terms relating to multiplication are 
applied to them. 

21. Numerical calculation of direct and skew products.—The 
properties demonstrated in the last two paragraphs (which may be 
briefly expressed by saying that the operations of direct and skew 
multiplication are distributive) afford the rule for the numerical 
calculation of a direct product, or of the components of a skew 
product, when the rectangular components of the factors are given 
numerically. In fact, if 

a=aityjtek, and B=an1+yj4+7k; 
a.B=ae' +yy' +27, 
and axB=(y2' —zy')it (ea — x2’) +(ay’ —yo')k. 

22. Representation of the area of a parallelogram by a skew 
product.—It will be easily seen that ax represents in magnitude 
the area of the parallelogram of which a and £8 (supposed drawn 
from a common origin) are the sides, and that it represents in direc- 
tion the normal to the plane of the parallelogram on the side on 
which the rotation from a toward 8 appears counter-clockwise. 

23. Representation of the volume of a parallelopiped by a triple 
product.—It will also be seen that ax8.y* represents in numerical 
value the volume of the parallelopiped of which a, 8, and y (supposed 
drawn from a common origin) are the edges, and that the value of 
the expression is positive or negative according as y lies on the side 
of the plane of a and 6 on which the rotation from a to 6 appears 


counter-clockwise, or on the opposite side. 
24. Hence, 


axB.y=Bxy.a=yxa.B=y.axB=a.Bxy 
=B.yxa=—Bxay=—yx8.a=—axy.B 
=—y.Bxa=—ayxB=—B.axy. 
It will be observed that all the products of this type, which can be 
made with three given vectors, are the same in numerical value, and 


* Since the sign x is only used between vectors, the skew multiplication in expressions 


of this kind is evidently to be performed first. In other words, the above expression 
must be interpreted as [ax].y. 
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that any two such products are of the same or opposite character in 
respect to sign, according as the cyclic order of the letters is the same 
or different. The product vanishes when two of the vectors are 
parallel to the same line, or when the three are parallel to the same 
plane. 

This kind of product may be called the scalar product of the three 
vectors. There are two other kinds of products of three vectors, both 
of which are vectors, viz., products of the type (a.8)y or y(a.), and 
products of the type ax[@xy] or [yx8]xa. 

25. 19xk=jp.kxiskixg=l  ikxj=k.jxisjixk=—1. 

From these equations, which follow immediately from those of 
No. 17, the propositions of the last section might have been derived, 
viz., by substituting for a, 8, and y, respectively, expressions of the 
form wi+yj+zk, vityj+7k, and wit+y’j+2’k.* Such a method, 
which may be called expansion in terms of i,j, and k, will on many 
occasions afford very simple, although perhaps lengthy, demonstrations, 

26. Triple products containing only two different letters—The 
significance and the relations of (a.a)8, (a.8)a, and ax[ax] will 
be most evident, if we consider 8 as made up of two components, 
§ and 8", respectively parallel and perpendicular to a. Then 

B=B8'+8", 
(a.8)a=(a.8')a=(a.a) 6, 
; ax[ax8]=ax[axf"]= —(a.a) 6". 
Hence, ax[axB]=(a.B)a—(a.a)B. 

27. General relation of the vector products of three factors.—In the 
triple product ax[6xy] we may set 

a=lB+my+nb xy, 
unless 6 and y have the same direction. Then 


ax[Bxy]=/8x[Sxy]+myx[Bxy] 
=U(B.y)B-U(B.B)y—m(y.B)y+mly-y)P 
= (IB.y+my-y)B— (18.8 +my.B)y. 
But lB.yt+my.y=a.y, and 18.8+my.8=a.B. 
Therefore ax[Bxy]=(a-y)B—(a.8)y, 
which is evidently true, when 8 and y have the same directions. It 
may also be written 


[yx B]xa=B(y.a)—y(8-a). 


* The student who is familiar with the nature of determinants will not fail to observe 
that the triple product a.8xy is the determinant formed by the nine rectangular 
components of a, 8, and y, nor that the rectangular components of axf are determinants 
of the second order formed from the components of a and f. (See the last equation of 


No. 21.) 
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28. This principle may be used in the transformation of more 
complex products. It will be observed that its application will 
always simultaneously eliminate, or introduce, two signs of skew 
multiplication. 

The student will easily prove the following identical equations, 
which, although of considerable importance, are here given principally 
as exercises in the application of the preceding formule. 


29. ax[Bxy]+6x[yxa]+yx[ax6]=0. 


30. [axB].[yx6] =(a-y)(B.5)—(a-6)(B-y). 

SI. [axB]x[yx6]=(a-yx6)B—-(B-yxd)a 
=(a.Bxd)y—(a.Bxy)6. 

32. ax[Bx[yx6]] =(a.yx6)B—(a.8) xo 


=(8.d)axy—(B-y) ax. 
38. [ax ].[yx6]x[ex ]=(a.Bxd)(y.ex§) —(a.Bxy)(6.exQ) 
=(a.Bxe)(¢-yxd)—(a. Bx) (e.yx6) 
=(y.dxa)(B.ex QO) —(y.6xB)(a-exO). 

34. [ax8].[8xy]xlyxa]=(a.Bxy). 

35. The student will also easily convince himself that a product 
formed of any number of letters (representing vectors) combined in 
any possible way by scalar, direct, and skew multiplications may be 
reduced by the principles of Nos. 24 and 27 to a sum of products, 
each of which consists of scalar factors of the forms a.8 and a.Bxy, 
with a single vector factor of the form a or axf, when the original 
product is a vector. 

36. Elimination of scalars from vector equations.—It has already 
been observed that the elimination of vectors from equations of the 
form 

aa+bB+cy+dé+ete.=0 

is performed by the same rule as the eliminations of ordinary algebra. 
(See No. 9.) But the elimination of scalars from such equations is at 
least formally different. Since a single vector equation is the equi- 
valent of three scalar equations, we must be able to deduce from such 
an equation a scalar equation from which two of the scalars which 
appear in the original vector equation have been eliminated. We 
shall see how this may be done, if we consider the scalar equation 


aa.rA+bB.A+cy.A+d6.rA+ ete. = 0, 


which is derived from the above vector equation by direct multipli- 
cation by a vector A. We may regard the original equation as the 
equivalent of the three scalar equations obtained by substituting for 
a, , y, 6, ete., their X-, Y-, and Z-components. The second equation 
would be derived from these by multiplying them respectively by 


a a. ins 
; 
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the X-, Y-, and Z-components of \ and adding. Hence the second 
equation may be regarded as the most general form of a scalar equa- 
tion of the first degree in a, b, c,d, ete., which can be derived from 
the original vector equation or its equivalent three scalar equations. 
If we wish to have two of the scalars, as b and ¢, disappear, we have 
only to choose for A a vector perpendicular to 8 and y. Such a 
vector is 6xy. We thus obtain 
aa.Bxy+dé.Bxy+ete.=0. 
37. Relations of fowr vectors.—By this method of elimination we 
may find the values of the coefficients a, b, and ¢ in the equation 
p=datbB+cy, (1) 
by which any vector p is expressed in terms of three others. (See 
No. 10.) If we multiply directly by Bxy, yxa, and ax, we obtain 
p-Bxy=aa.Bxy, p-yxa=bB.yxa, p.axB=cy.axB; (2) 
whence F i 
p:-PXY _Pp:YXx@ —P:ax ’ 3 
a.Bxy’ 6 a.Bxy’ ‘ a.Bxy (3) 
By substitution of these values, we obtain the identical equation, 
(a.Bxy) p=(p.Bxy) a + (p-yxa) 8 + (p.axB) y. (4) 
(Compare No. 31.) If we wish the four vectors to appear symmetri- 
cally in the equation we may write 


(a.Bxy) p — (B.yxp)a + (y.pxa) B — (p.axB) y=0. (5) 

If we wish to express p as a sum of vectors having directions 

perpendicular to the planes of a and £, of 8 and y, and of y and a, 
we may write 


a= 


p=eBxytfyxatgaxB. (6) 
To obtain the values of e, f, g, we multiply dzrectly by a, by B, and 
by y. This gives 


__p-a __p-B HEA Pin (7) 
o= Buyxa’ Ai y.axB’ J a.Bxy 
Substituting these values we obtain the identical equation 
(a.Bxy) p=(p-a) Bxy + (p.8) yxa + (p-y) ax. (8) 


(Compare No. 32.) 
38. Reciprocal systems of vectors—The results of the preceding 
section may be more compactly expressed if we use the abbreviations 


ERY. /__yXxa a ax (1) 
Oa.Bxy’ "~ Beyxa’ Y~ y.axB 

The identical equations (4) and (8) of the preceding number thus 

become 


p=(p.a')at(p.B)B+(p-y)y, (2) 
p=(p.a)a’+(p.B)B'+(p.-y)y: (3) 
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We may infer from the similarity of these equations that the relations _ 
of a, B, y, and a’, B’, y’ are reciprocal, a proposition which is easily 
proved directly. For noe bs 

Bx Gk 

a. i Bxy” OB axa? ¥¥dxB © 
are satisfied identically by the aera of the values of a’, 6’, 
and y’ given in equations (1). (See Nos. 31 and 34.) 

Def.—It will be convenient to use the term reciprocal to designate 
these relations, i.e., we shall say that three vectors are reciprocals of 
three others, when they satisfy relations similar to those expressed in 
equations (1) or (4). 

With this understanding we may say :— 

The coefficients by which any vector is expressed in terms of 
three other vectors are the direct products of that vector with the 
reciprocals of the three. 

Among other relations which are satisfied by reciprocal systems 
of vectors are the following: 


a.a’ =.’ =y-/'=1, 
a8=0, ay =0, B.a=0, B.y=0, ya=0, ».f=0 Via 
These nine equations may be regarded as defining the relations 
between a, 6, y and a’, 8’, y’ as reciprocals. 


(a.Bxy)(a. Bxy)=1. (6) 
axa +BxB' +yxy'=0. (7) 


C= 


(See No. 34.) 


(See No. 29.) 
A system of three mutually perpendicular unit vectors is reciprocal 
to itself, and only such a system. 
The identical equation 
p=(p.ii+(pij+(p-k)k (8) 
may be regarded as a particular case of equation (2). 
The system reciprocal to axB, Bxy, yxa is 
a xP’; Bxy’, y' xa’, 
or a B y 
a.Bxy’ a.Bxy’ a.Bxy" . 
38a. If we multiply the identical equation (8) of No. 37 by ¢xz, 
we obtain the equation 
(a.Bxy)(p.oxr)=a.p(B.0 y.7—B.7 y.c) 
+B. ply.cat—y.7a.c)+y.p(a.o B.r—a.7 B.c), 
which is therefore identical. But this equation cannot subsist identi- 
cally, unless 
(a.Bxy)ox7=a(B.c y.7—B.ry.c) 
+ Bly.c a.7t—y.7a.0)+y(a.¢ B.r—a.7 B.c) 
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is also an identical equation. (The reader will observe that in each 
of these equations the second member may be expressed as a deter- 
minant.) 

From these transformations, with those already given, it follows 
that a product formed of any number of letters (representing vectors 
and scalars), combined in any possible way by scalar, direct, and skew 
multiplications, may be reduced to a sum of products, containing 
each the sign x once and only once, when the original product 
contains it an odd number of times, or entirely free from the sign, 
when the original product contains it an even number of times. 

39. Scalar equations of the first degree with respect to an unknown 
vector.—It is easily shown that any scalar equation of the first 
degree with respect to an unknown vector p, in which all the other 
quantities are known, may be reduced to the form 

p-a=a, 
in which a and @ are known. (See No. 35.) Three such equations 
will afford the value of p (by equation (8) of No. 37, or equation (8) 
of No. 38), which may be used to eliminate p from any other equation 
either scalar or vector. 

When we have four scalar equations of the first degree with 
respect to p, the elimination may be performed most symmetrically 
by substituting the values of p.a, etc., in the equation 

(p.a)(8-yx6) —(p.8)(y.dxa)+(p-y)(6.ax8)—(p.6)(a.Bxy)=0, 
which is obtained from equation (8) of No. 37 by multiplying 
directly by 6. It may also be obtained from equation (5) of No. 37 
by writing 6 for p, and then multiplying directly by p. 

40. Solution of a vector equation of the first degree with respect to 
the unknown vector.—It is now easy to solve an equation of the 


om S=a(A.p)+B(u-p)+y(v-p): (1) 
where a, 6, y, 6, A, » and vy represent known vectors. Multiplying 
directly by Bxy, by yxa, and by ax, we obtain 
B.yxd=(B-yxa)(Q.p) y-axd=(y.axB)(u-p) 
a.8x6=(a.Bxy)(v-p); 
or a.d=A.p, B.d=mu.p, y.O=v.p, 
where a’, 6’, y are the reciprocals of a, 8, y. Substituting these 
values in the identical equation 
p=N(A.p) +m (u-p)+r'(v-p); 
in which 2’, yw’, »’ are the reciprocals of A, m, v (see No. 38), we have 
p=n (a’.6) +n (8.8) +¥ (8), (2) 


which is the solution required. 


28 VECTOR ANALYSIS. 


It results from the principle stated in No. 35, that any vector 
equation of the first degree with respect to p may be reduced to the 


ee d=a(A.p)+B(u-p)+y(v.p)+ap +exp. 
But ap=an'(r.p) tap’ (u.p)+ar(v.p), 
and exp=exnr (A.p)texpu'(u.p) tex (v.p); 


where 2’, «’, »’ represent, as before, the reciprocals of A, pw, v. By 
substitution of these values the equation is reduced to the form of 
equation (1), which may therefore be regarded as the most general 
form of a vector equation of the first degree with respect to p. 

41. Relations between two normal systems of unit vectors.—If 
i,j, k, and 7’, 7’, k’ are two normal systems of unit vectors, we have 


VH=OV +9. yj +b.0)K, 
SHOJI MAGI 9 HEI) (1) 
kK =(i.k i +(j.K)j+(k.RYK, 

and t=. WHA) 4+GEDEK, 
J=GOWCAGDI)I AG EVE, | (2) 
h=(k.v)v +(k.j’)f H(KRYE. 


(See equation (8) of No. 38.) 

The nine coefficients in these equations are evidently the cosines 
of the nine angles made by a vector of one system with a vector of 
the other system. The principal relations of these cosines are easily 
deduced. By direct multiplication of each of the preceding equations 
with itself, we obtain six equations of the type 

APEGAPERAPEL (3) 
By direct multiplication of equations (1) with each other, and of 
equations (2) with each other, we obtain six of the type 
COV OP)AGH)GIVAGE.1) (hj) =0. (4) 
By skew multiplication of equations (1) with each other, we obtain 
three of the type 
BAG OVE IVA EM GINA AVO7)-C.AUVEING 
HGEYGIV-G-VV GF) h 
Comparing these three equations with the original three, we obtain 
nine of the type 
0. = (9.0) (ke.9')—(h.0V(G.9'). (5) 
Finally, if we equate the scalar product of the three right hand 
members of (1) with that of the three left hand members, we obtain 
0) G- IVE AOIVG KETV +ORYG.UM(E.§) 
hAVGIVORV— FIV IMG ERVG.4)U.7)=1. (6) 

Equations (1) and (2) (if the expressions in the parentheses are 

supposed replaced by numerical values) represent the linear relations 
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which subsist between one vector of one system and the three vectors 
of the other system. If we desire to express the similar relations 
which subsist between two vectors of one system and two of the other 
we may take the skew products of equations (1) with equations (2), 
after transposing all terms in the latter. This will afford nine 
equations of the type 


(U7) — (0.K) 7’ = (eV) jj — (9.0K. (7) 


We may divide an equation by an indeterminate direct factor. [MS. note by author. ] 


) 


CHAPTER II. 


CONCERNING THE DIFFERENTIAL AND INTEGRAL CALCULUS OF VECTORS. 


42. Differentials of vectors—The differential of a vector is the 
geometrical difference of two values of that vector which differ 
infinitely little. It is itself a vector, and may make any angle with 
the vector differentiated. It is expressed by the same sign (d) as the 
differentials of ordinary analysis. 

With reference to any fixed axes, the components of the differential 
of a vector are manifestly equal to the differentials of the components 
of the vector, ie., if a, 8, and y are fixed unit vectors, and 


p=tatyB+z2y, 
dp=daa+dy B+dz y. 


43. Differential of a function of several variables.—The differential 
of a vector or scalar function of any number of vector or scalar 
variables is evidently the sum (geometrical or algebraic, according as 
the function is vector or scalar) of the differentials of the function 
due to the separate variation of the several variables. 

44, Differential of a product—tThe differential of a product of any 
kind due to the variation of a single factor is obtained by prefixing 
the sign of differentiation to that factor in the product. This is 
evidently true of differentials, since it will hold true even of finite 
differences. 

45. From these principles we obtain the following identical 
equations : 


d(at+ B)=da+df, (1) 
d(na)=dn a+nda, (2) 
d(a.B)=da.B+a.dp, (3) 
d({axB]=dax8+axd£, (4) 
d(a.Bxy)=da.BxytadBxyt+a.Bxdy, (5) 


d{(a.B) y]=(da.8) y+(a.d8) y+ (a.B) dy. (6) 
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46. Differential coefficient with respect to a scalar.—The quotient 
obtained by dividing the differential of a vector due to the variation 
of any scalar of which it is a function by the differential of that 
scalar is called the differential coefficient of the vector with respect 
to the scalar, and is indicated in the same manner as the differential 
coefficients of ordinary analysis. 

If we suppose the quantities occurring in the six equations of the 


last section to be functions of a scalar t, we may substitute 7 for d in 


those equations since this is only to divide all terms by the scalar dt. 

47, Successive differentiations—The differential coefficient of a 
vector with respect to a scalar is of course a finite vector, of which 
we may take the differential, or the differential coefficient with 
respect to the same or any other scalar. We thus obtain differential 
coefficients of the higher orders, which are indicated as in the scalar 
calculus. 

A few examples will serve for illustration. 

If p is the vector drawn from a fixed origin to a moving point at 


any time ¢, 4 will be the vector representing the velocity of the 


2 
point, and es the vector representing its acceleration. 
If p is the vector drawn from a fixed origin to any point on a 
curve, and s the distance of that point measured on the curve from 


any fixed point, “p is a unit vector, tangent to the curve and having 


the direction in which s increases; ell is a vector directed from a 


ds 

point on the curve to the center of curvature, and equal to the 
curvature ; + oe is the normal to the osculating plane, directed to 
the side on which the curve appears described counter-clockwise 
about the center of curvature, and equal to the curvature. The 
tortuosity (or rate of rotation of the osculating plane, considered as 
positive when the rotation appears counter-clockwise as seen from 
the direction in which s increases) is represented by 

dp dp dp 
ds_ ds’ ds* 
~ ep Bp 

ds? ds* 

48. Integration of an equation between differentials—If t and w 
are two single-valued continuous scalar functions of any number of 
scalar or vector variables, and 

dt=du, 
then t=u+a, 
where a is a scalar constant. 


= - 
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Or, if + and are two single-valued continuous vector functions 
of any number of scalar or vector variables, and 

dtr =dw, 
then T=wo+a, 
where a is a vector constant. 

When the above hypotheses are not satisfied in general, but will be 
satisfied if the variations of the independent variables are confined 
within certain limits, then the conclusions will hold within those limits, 
provided that we can pass by continuous variation of the independent 
variables from any values within the limits to any other values 
within them, without transgressing the limits. 

49. So far, it will be observed, all operations have been entirely 
analogous to those of the ordinary calculus. 


Functions of Position in Space. 


50. Def—If uw is any scalar function of position in space (ie., any 
scalar quantity having continuously varying values in space), Vw is 
the vector function of position in space which has everywhere the 
direction of the most rapid increase of wu, and a magnitude equal to 
the rate of that increase per unit of length. Vw may be called the 
derivative of u, and u, the primitive of Vu. 

We may also take any one of the Nos. 51, 52, 53 for the definition 
of Vu. 

51. If p is the vector defining the position of a point in space, 


du=Vu. dp. 
du, .du,,du 
52. Vu=t7 +) saa 
du_. du_. dw _ 
53. dg Aer htate = =k.Vu. 
54. Def.—lIf w is a vector having continuously varying values in space, 
. dw, .dw , 1, dw 1 
Vio= 6 td ty tae’ (1) 
rode! ha Gh dw» 
and Vxw=ixe +ixqy thxg: (2) 
V.w is called the divergence of w and Vxw its curl. 
If we set o=Xi+ Vj+Zk, 
we obtain by substitution the equations 
dX dY dZ 
V.o= Sie + dy ip 
(ah aY, .(@X dz adY dX 
gd vxw=i(e— Se) +i (Ge ae) tae ap) 


which may also be regarded as defining V.w and Vxw. 
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55. Surface-integrals.—The integral //w.do, in which do represents 
an element of some surface, is called the surface-integral of w for that 
surface. It is understood here and elsewhere, when a vector is said 
to represent a plane surface (or an element of surface which may be 
regarded as plane), that the magnitude of the vector represents the 
area of the surface, and that the direction of the vector represents 
that of the normal drawn toward the positive side of the surface. 
When the surface is defined as the boundary of a certain space, the 
outside of the surface is regarded as positive. 

The surface-integral of any given space (ie., the surface-integral 
of the surface bounding that space) is evidently equal to the sum of 
the surface-integrals of all the parts into which the original space 
may be divided. For the integrals relating to the surfaces dividing 
the parts will evidently cancel in such a sum. 

The surface-integral of w for a closed surface bounding a space dv 
infinitely small in all its dimensions is 


V.wdv. 


This follows immediately from the definition of Vw, when the space 
is a parallelopiped bounded by planes perpendicular to 7, 7, k. In 
other cases, we may imagine the space—or rather a space nearly 
coincident with the given space and of the same volume dv—to be 
divided up into such parallelopipeds. The surface-integral for the 
space made up of the parallelopipeds will be the sum of the surface- 
integrals of all the parallelopipeds, and will therefore be expressed by 
V.wdv. The surface-integral of the original space will have sensibly 
the same value, and will therefore be represented by the same 
formula. It follows that the value of V. does not depend upon the 
system of unit vectors employed in its definition. 

It is possible to attribute such a physical signification to the 
quantities concerned in the above proposition, as shall make it evident 
almost without demonstration. Let us suppose » to represent a flux 
of any substance. The rate of decrease of the density of that sub- 
stance at any point will be obtained by dividing the surface-integral 
of the flux for any infinitely small closed surface about the point by 
the volume enclosed. This quotient must therefore be independent of 
the form of the surface. We may define V.w as representing that 
quotient, and then obtain equation (1) of No. 54 by applying the 
general principle to the case of the rectangular parallelopiped. 

56. Skew surface-integrals.—The integral //dexw may be called 
the skew surface-integral of . It is evidently a vector. For a 
closed surface bounding a space dv infinitely small in all dimensions 


this integral reduces to Vxw dv, as is easily shown by reasoning like 
that of No. 55. 


7 1 re om” 
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57. Integration.—If dv represents an element of any space, and dco 
an element of the bounding surface, 


SLL V.0 dw =ffw.de. 


For the first member of this equation represents the sum of the 
surface-integrals of all the elements of the given space. We may 
regard this principle as affording a means of integration, since we 
may use it to reduce a triple integral (of a certain form) to a double 
integral. 

The principle may also be expressed as follows: 

The surface-integral of any vector function of position in space for 
a closed surface is equal to the volume-integral of the divergence of 
that function for the space enclosed. 

58. Line-integrals.—The integral /w.dp, in which dp denotes the 
element of a line, is called the line-integral of w for that line. It is 
implied that one of the directions of the line is distinguished as 
positive. When the line is regarded as bounding a surface, that side 
of the surface will always be regarded as positive, on which the 
surface appears to be circumscribed counter-clockwise. 

59. Integration.—From No. 51 we obtain directly 

SVu.dp =u" —w, 
where the single and double accents distinguish the values relating 
to the beginning and end of the line. 

In other words,—The line-integral of the derivative of any (con- 


tinuous and single-valued) scalar function of position in space is equal 
to the difference of the values of the function at the extremities of 


-the line. For a closed line the integral vanishes. 


60. Integration.—The following principle may be used to reduce 
double integrals of a certain form to simple integrals. 
If do represents an element of any surface, and dp an element of 


the bounding line, 
SIV xo.de=fo.dp. 


In other words,—The line-integral ot any vector function of position 
in space for a closed line is equal to the surface-integral of the curl of 
that function for any surface bounded by the line. 

To prove this principle, we will consider the variation of the 
line-integral which is due to a variation in the closed line for which 
the integral is taken. We have, in the first place, 


éfo.dp=fou.dp+fo.6 dp. 
But o.6 dp =d(w.dp)—dw.dp. 
Therefore, since /d(w.dp)=0 for a closed line, 


Sfw.dp=fbo.dp—fdw.ép. 
G. II. Cc 
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Now bo = | 52 6 |= E (1.8). 
ad do=3| Sdn |=3| 52 (i.dp)|, 


where the summation relates to the coordinate axes and connected 
quantities. Substituting these values in the preceding equation, 


a. sfo.dp=S% ((i.6p) ($2.dp) ~(i.dp)(52.6p)): 
or by No. 30, 
Sitges ES [8oxdp]=/Vxw.[éoxdp] 


But épxdp represents an element of the surface generated by the 
motion of the element dp, and the last member of the equation is 
the surface-integral of Vxw for the infinitesimal surface generated 
by the motion of the whole line. Hence, if we conceive of a closed 
curve passing gradually from an infinitesimal loop to any finite form, 
the differential of the line-integral of w for that curve will be equal 
to the differential of the surface integral of Vxw for the surface 
generated: therefore, since both integrals commence with the value 
zero, they must always be equal to each other. Such a mode of 
generation will evidently apply to any surface closing any loop. 

61. The line-integral of w for a closed line bounding a plane surface 
do infinitely small in all its dimensions is therefore 


Vxw.deo. 


This principle affords a definition of Vxw which is independent 
of any reference to coordinate axes. If we imagine a circle described 
about a fixed point to vary its orientation while keeping the same 
size, there will be a certain position of the circle for which the line- 
integral of w will be a maximum, unless the line-integral vanishes for 
all positions of the circle. Thg axis of the circle in this position, 
drawn toward the side on which a positive motion in the circle 
appears counter-clockwise, gives the direction of Vxw, and the 
quotient of the integral divided by the area of the circle gives the 
magnitude of Vxw. 


V, V., and Vx applied to Functions of Functions of Position. 


62. A constant scalar factor after V, V., or Vx may be placed 
before the symbol. 


_ 63. If f(w) denotes any scalar function of w, and f’(w) the derived 
function, : 
VAM =f (uw) Vu. 
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64. If w or w is a function of several scalar or vector variables 
which are themselves functions of the position of a single point, the 
value of Vw or V.w or Vxw will be equal to the sum of the values 
obtained by making successively all but each one of these variables 
constant. 

65. By the use of this principle we easily derive the following 
identical equations : 


V(é+u)=Vi+Vu. (1) 

Vir +o)=V.7T+V.0. Vx[r+0]=Vx7T+Vxo. (2) 
V(tu)=uVi+tVu. (3) 
V.(uw)=0.Vutuv.o. (4) 
Vx[ue]=uVxw—oxVu. (5) 
V.[tTxo]=0.Vx7—7.Vxo. (6) 


The student will observe an analogy between these equations and 
the formulz of multiplication. (In the last four equations the 
analogy appears most distinctly when we regard all the factors but 
one as constant.) Some of the more curious features of this analogy 
are due to the fact that the V contains implicitly the vectors %, j 
_and k, which are to be multiplied into the following quantities. 


Combinations of the Operators V, V., and Vx. 


66. If wu is any scalar function of position in space, 
VxVu=0, 


as may be derived directly from the definitions of these operators. 
67. Conversely, if w is such a vector function of position in space 


that Vxwa0. 


w is the derivative of a scalar function of position in space. This will 
appear from the following considerations : ' 

The line-integral ,/w@.dp will vanish for any closed line, since it may 
be expressed as the surface-integral of Vxw. (No. 60.) The line- 
integral taken from one given point P’ to another given point P” is 
independent of the line between the points for which the integral 
is taken. (For, if two lines joining the same points gave different 
values, by reversing one we should obtain a closed line for which the 
integral would not vanish.) If we set w equal to this line-integral, 
supposing P” to be variable and P’ to be constant in position, w will 
be a scalar function of the position of the point P”, satisfying the 
condition dw=w.dp, or, by No. 51, Vuw=o. There will evidently be 
an infinite number of functions satisfying this condition, which will 
differ from one another by constant quantities. 
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If the region for which Vxw=0 is unlimited, these functions will 
be single-valued. If the region is limited, but acyclic,* the functions 
will still be single-valued and satisfy the condition Vu= within the 
same region. If the region is cyclic, we may determine functions 
satisfying the condition Vu=w within the region, but they will not 
necessarily be single-valued. 

68. If w» is any vector function of position in space, V.Vxw=0. 
This may be deduced directly from the definitions of No. 54. 

The converse of this proposition will be proved hereafter. 

69. If w is any scalar function of position in space, we have by 
Nos. 52 and 54 ae 

70. Def—If » is any vector function of position in space, we may 
define V.Vw by the equation 


V.Vo=( 


V.Vu=( 


A ae 

ae iat ta) w, 

the expression V.V being regarded, for the present at least, as a single 
operator when applied to a vector. (It will be remembered that no 
meaning has been attributed to. V before a vector.) It should be 


noticed that if 
o=IX+jY +kZ, 


V.Vo=tV.VX+4j7V.VY +kV.VZ, 


that is, the operator V.V applied to a vector affects separately its 
scalar components. 
71. From the above definition with those of Nos. 52 and 54 we 


may easily obtain 
V.Vo=VV.08—-VxV xo. 


The effect of the operator V.V is therefore independent of the direc- 
tions of the axes used in its definition. 


72. The expression —4a?V.Vu, where @ is any infinitesimal scalar, 
evidently represents the excess of the value of the scalar function w 


*If every closed line within a given region can contract to a single point without 
breaking its continuity, or passing out of the region, the region is called acyclic, other- 
wise cyclic. 

A cyclic region may be made acyclic by diaphragms, which must then be regarded as 
forming part of the surface bounding the region, each diaphragm contributing its own 
area twice to that surface. This process may be used to reduce many-valued functions 
of position in space, having single-valued derivatives, to single-valued functions. 

When functions are mentioned or implied in the notation, the reader will always 
understand single-valued functions, unless the contrary is distinctly intimated, or the 
case is one in which the distinction is obviously immaterial. Diaphragms may be 


applied to bring functions naturally many-valued under the application of some of the 
following theorems, as Nos. 74 ff. 


ii ihe 
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at the point considered above the average of its values at six points 
at the following vector distances: ai, — «ai, a, —aj, ak, —ak. Since 
the directions of i, j, and & are immaterial (provided that they are 
at right angles to each other), the excess of the value of wu at the 
central point above its average value in a spherical surface of radius 
@ constructed about that point as the center will be represented by 
the same expression, —1a2V.Vu. 

Precisely the same is true of a vector function, if it is understood 
that the additions and subtractions implied in the terms average and 
excess are geometrical additions and subtractions. 

Maxwell has called —V.Vw the concentration of wu, whether w is 
sealar or vector. We may call V.Vw (or V.Vw), which is proportioned 
to the excess of the average value of the function in an infinitesimal 
spherical surface above the value at the center, the dispersion of 
w (or w). 


Transformation of Definite Integrals. 


73. From the equations of No. 65, with the principles of integration 
of Nos. 57, 59, and 60, we may deduce various transformations of 
definite integrals, which are entirely analogous to those known in the 
scalar calculus under the name of integration by parts. The follow- 
ing formule (like those of Nos. 57, 59, and 60) are written for the 
case of continuous values of the quantities (scalar and vector) to which 
the signs V, V., and Vx are applied. It is left to the student to 
complete the formule for cases of discontinuity in these values. The 
manner in which this is to be done may in each case be inferred from 
the nature of the formula itself. The most important discontinuities 
of scalars are those which occur at surfaces: in the case of vectors 
discontinuities at surfaces, at lines, and at points, should be considered. 

74, From equation (3) we obtain 


SV (tu). dp=t'u"’ —tu’ =fuVi.dp+ftVu.dp, 
where the accents distinguish the quantities relating to the limits of 
the line-integrals. We are thus able to reduce a line-integral of the 
form /uVi.dp to the form —/tVw.dp with quantities free from the 
sign of integration. 
75. From equation (5) we obtain 


SV x(ue).do =fu0.dp=/fuV xo.de —{fwxVu.de, 


where, as elsewhere in these equations, the line-integral relates to the 
boundary of the surface-integral. 
From this, by substitution of Vé for w, we may derive as a 


particular case 


SL VuxVt.de=fuVt.dp = —/tVu.dp. 
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76. From equation (4) we obtain 


SIL V [wo]dv ={{w.dr={f[foVudv+ff/fuV.odo, 


where, as elsewhere in these equations, the surface-integral relates to 
the boundary of the volume-integrals. 
From this, by substitution of Vt for w, we derive as a particular case 


SMV Nudv=ffuvt.da—[[fuV Vt dv =/fWu.de ffftV. Vu do, 


which is Green’s Theorem. The substitution of sVé¢ for w gives the 
more general form of this theorem which is due to Thomson, v14z., 


SN [stu do=ffusVt.do —ff/uv [sVt]dv 
=/fisVu.do—f//tV.[sVu]dv. 


77. From equation (6) we obtain 


If Virxoldv=/[rx0.do=f[fw.V xr dv —f/f{frVxw dv. 


A particular case is 


S{Vu.V xo dv=f[foxVu.de. 


Integration of Differential Equations. 


78, If throughout any continuous space (or in all space) 
Vu=0, 
then throughout the same space 
w= constant. 
79. If throughout any continuous space (or in all space) 
V.Vu=0, 
and in any finite part of that space, or in any finite surface in or 
bounding it, 
Vu=0, 
then throughout the whole space 
Vu=0, and w=constant. 


This will appear from the following considerations : 

If Vu=0 in any finite part of the space, w is constant in that part. 
If w is not constant throughout, let us imagine a sphere situated 
principally in the part in which w is constant, but projecting slightly 
into a part in which w has a greater value, or else into a part in 
which ~ has a less. The surface-integral of Vw for the part of the 
spherical surface in the region where w is constant will have the 
value zero: for the other part of the surface, the integral will be 
either greater than zero, or less than zero. Therefore the whole 
surface-integral for the spherical surface will not have the value zero, 
which is required by the general condition, V.Vu=0. 

Again, if Vw=0 only in a surface in or bounding the space in 


SNe eee 
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which V.Vw=0, w will be constant in this surface, and the surface 
will be contiguous to a region in which V.Vw=0 and w has a greater 
value than in the surface, or else a less value than in the surface. 
Let us imagine a sphere lying principally on the other side of the 
surface, but projecting slightly into thisregion, and let us particularly 
consider the surface-integral of Vw for the small segment cut off by 
the surface Vw=0. The integral for that part of the surface of the 
segment which consists of part of the surface Vuw=0 will have the 
value zero, the integral for the spherical part will have a value either 


_ greater than zero or else less than zero. Therefore the integral for 


the whole surface of the segment cannot have the value zero, which 
is demanded by the general condition, V.Vu=0. 

80. If throughout a certain space (which need not be continuous, 
and which may extend to infinity) 


V.Vuw=0, 
and in all the bounding surfaces 
uw=constant=a, 


and (in case the space extends to infinity) if at infinite distances 
within the space w=a,—then throughout the space 
Vu=0, and w=a. 

For, if anywhere in the interior of the space Vu has a value 
different from zero, we may find a point P where such is the case, 
and where w has a value b different from a,—to fix our ideas we will 
say less. Imagine a surface enclosing all of the space in which w <b. 
(This must be possible, since that part of the space does not reach to 
infinity.) The surface-integral of Vw for this surface has the value 
zero in virtue of the general condition V.Vu=0. But, from the 
manner in which the surface is defined, no part of the integral can be 
negative. Therefore no part of the integral can be positive, and the 
supposition made with respect to the point P is untenable. That the 
supposition that b>«a is untenable may be shown in a similar 
manner. Therefore the value of w is constant. 

This proposition may be generalized by substituting the condition 
V.[tVu]=0 for V.Vw=0, t denoting any positive (or any negative) 
scalar function of position in space. The conclusion would be the 
same, and the demonstration similar. 

81. If throughout a certain space (which need not be continuous, 
and which may extend to infinity) 

V.Vu=0, 


and in all the bounding surfaces the normal component of Vw vanishes, 


‘ du 
and at infinite distances within the space (if such there are) Were =0, 
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where r denotes the distance from a fixed origin, then throughout the 
space 
Vu=0, 
and in each continuous portion of the same 
u=constant. 


For, if anywhere in the space in question Vw has a value different 
from zero, let it have such a value at a point P, and let w be there 
equal to b. Imagine a spherical surface about the above-mentioned 
origin as center, enclosing the point P, and with a radius r. Con- 
sider that portion of the space to which the theorem relates which is 
within the sphere and in which w<b. The surface integral of Vu 
for this space is equal to zero in virtue of the general condition 
V.Vu=0. That part of the integral (if any) which relates to a 
portion of the spherical surface has a value numerically not greater 


than 4rr2(S) , where ee denotes the greatest numerical value of 


ae in the portion of the spherical surface considered. Hence, the 


value of this part of the surface-integral may be made less (numeri- 
cally) than any assignable quantity by giving to r a sufficiently great 
value. Hence, the other part of the surface-integral (viz., that relating 
to the surface in which w=b, and to the boundary of the space to 
which the theorem relates) may be given a value differing from zero 
by less than any assignable quantity. But no part of the integral 
relating to this surface can be negative. Therefore no part can be 
positive, and the supposition relative to the point P is untenable. 

This proposition also may be generalized by substituting V.[¢Vu]=0 


du du 
for V.Vu=0, and te =0 for nT =0, 
82. If throughout any continuous space (or in all space) 
Vi=Vu, 


then throughout the same space 
¢=u-+const. 


The truth of this and the three following theorems will be apparent if 
we consider the difference t— wu. 


83. If throughout any continuous space (or in all space) 
V.Vi=V.Vu, 


and in any finite part of that space, or in any finite surface in or 
bounding it, 

Vt=Vu, 
then throughout the whole space 


Vt=Vu, and t=w-+const. 
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84. If throughout a certain space (which need not be continuous, 
and which may extend to infinity) 
V.Vi=V.Vu, 
and in all the bounding surfaces 
t=u, 
and at infinite distances within the space (if such there are) 
then throughout the space Rok 
t=u. 
85. If throughout a certain space (which need not be continuous, 
and which may extend to infinity) 
V.Vi=V.Vu, 
and in all the bounding surfaces the normal components of Vt and Vu 


are equal, and at infinite distances within the space (if such there are) 


7 (Z-) =o, where r denotes the distance from some fixed origin 


7 


—then throughout the space 
Vi=Vu, 


and in each continuous part of which the space consists 
¢—u=constant. 

86. If throughout any continuous space (or in all space) 

Vx7T=Vxw and V.tT=V.o, 
and in any finite part of that space, or in any finite surface in or 
bounding it, 
T=, 
then throughout the whole space 
T=. 

For, since Vx(r—)=0, we may set Vu=r—w, making the space 
acyclic (if necessary) by diaphragms, Then in the whole space w is 
single-valued and V.Vu=0, and in a part of the space, or in a surface 
in or bounding it, Vu=0. Hence throughout the space Vu=7—w=0. 

87. If throughout an aperiphractic* space contained within finite 
boundaries but not necessarily continuous 

Vx7T=Vxw and V.rT=V.o, 
and in all the bounding surfaces the tangential components of r and 
w are equal, then throughout the space 
T=. 

It is evidently sufficient to prove this proposition for a continuous 

space. Setting Vw=r—«, we have V.Vu=0 for the whole space, 


*If a space encloses within itself anotber space, it is called periphractic, otherwise 


aperiphractic. 
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and u=constant for its boundary, which will be a single surface for a 
continuous aperiphractic space. Hence throughout the space 


Vu=T—w=0. 
88, If throughout an acyclic space contained within finite boundaries 
but not necessarily continuous 
Vx7r=Vxw and V.T=V.o, 


and in all the bounding surfaces. the normal components of 7 and w 
are equal, then throughout the whole space 
T=0, 

Setting Vw=T—o, we have V.Vu=0 throughout the space, and 
the normal component of Vw at the boundary equal to zero. Hence 
throughout the whole space Vu=7—w=0. 

89. If throughout a certain space (which need not be continuous, 
and which may extend to infinity) 

V.Vr=V.Ve 
and in all the bounding surfaces 
0, 
and at infinite distances within the space (if such there are) 
T=, 
then throughout the whole space 
T=. 


This will be apparent if we consider separately each of the scalar 
components of 7 and wo, 


Minimum Values of the Volume-integral [//wu w.w dv. 
(Thomson’s Theorems.) 


90. Let it be required to determine for a certain space a vector 
function of position subject to certain conditions (to be specified 
hereafter), so that the volume-integral 


JST fweo.wdv 


for that space shall have a minimum value, u denoting a given 
positive scalar function of position. 

a. In the first place, let the vector w be subject to the conditions 
that V. is given within the space, and that the normal component 
of w is given for the bounding surface. (This component must of 
course be such that the surface-integral of » shall be equal to the 
volume-integral /'V.wdv. If the space is not continuous, this must 
be true of each continuous portion of it. See No. 57.) The solution 
is that Vx(uw)=0, or more generally, that the line-integral of uw for 
any closed curve in the space shall vanish. 


ae 
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The existence of the minimum requires that 


J) fu odo dv=0, 


while dw is subject to the limitation that 
V.d@=0, 


and that the normal component of dw at the bounding surface vanishes. 
To prove that the line-integral of ww vanishes for any closed curve 
within the space, let us imagine the curve to be surrounded by an 
infinitely slender tube of normal section dz, which may be either 
constant or variable. We may satisfy the equation V.dw=0 by 
making d#=0 outside of the tube, and dw de=6a°P within it, da 
denoting an arbitrary infinitesimal constant, p the position-vector, and 
ds an element of the length of the tube or closed curve. We have then 


Sf fuo.do dv=fu o.do dz ds=fu w.dp 6a =6afu w.dp=0, 
whence Juw.dp=0. Q.E.D. 


We may express this result by saying that ww is the derivative of a 
single-valued scalar function of position in space. (See No. 67.) 

If for certain parts of the surface the normal component of is not 
given for each point, but only the surface-integral of w for each such 
part, then the above reasoning will apply not only to closed curves, 
but also to curves commencing and ending in such a part of the 
surface. The primitive of ww will then have a constant value in 
each such part. 

If the space extends to infinity and there is no special condition 
respecting the value of w at infinite distances, the primitive of uw 
will have a constant value at infinite distances within the space or 
within each separate continuous part of it. 

If we except those cases in which the problem has no definite 
meaning because the data are such that the integral Juo.odv must 
be infinite, it is evident that a minimum must always exist, and (on 
account of the quadratic form of the integral) that it is unique. That 
the conditions just found are sufficient to insure this minimum, is 
evident from the consideration that any allowable values of dw may 
be made up of such values as we have supposed. Therefore, there 
will be one and only one vector function of position in space which 
satisfies these conditions together with those enumerated at the 
beginning of this number. es 

b. In the second place, let the vector be subject to the conditions 
that Vxw is given throughout the space, and that the tangential 
component of is given at the bounding surface. The solution is that 


V.[u wo] =9, 
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and, if the space is periphractic, that the surface-integral of wo 


vanishes for each of the bounding surfaces. 
The existence of the minimum requires that 


Sffuo.bodv=0, 


while dw is subject to the conditions that 

Vxdw=0, 
and that the tangential component of dw in the bounding surface 
vanishes. In virtue of these conditions we may set 

dw = Vogq, 
where 6q is an arbitrary infinitesimal scalar function of position, sub- 
ject only to the condition that it is constant in each of the bounding 
surfaces. (See No. 67.) By substitution of this value we obtain 


Sf fuw.V dq dv=0, 


or integrating by parts (No. 76) 
Jfuw.de 6qg—f/fV.[u w]sq dv=0. 


Since dq is arbitrary in the volume-integral, we have throughout the 
whole space 

V.[uo]=0; 
and since dq has an arbitrary constant value in each of the bounding 
surfaces (if the boundary of the space consists of separate parts), we 
have for each such part 


J{uw.de=0. 


Potentials, Newtonians, Laplacians. 


91. Def—If w’ is the scalar quantity of something situated at a 
certain point p’, the potential of wu’ for any point p is a scalar function 
of p, defined by the equation 


pot u’= pcs 

[Lp —plo’ 
and the Newtonian of w’ for any point p is a vector function of p 
defined by the equation 


i Dap ae 
new w S(eieee 

Again, if w’ is the vector representing the quantity and direction of 
something situated at the point p’, the potential and the Laplacian of 
w’ for any point p are vector functions of p defined by the equations 


poles 
Le —plo 


lap wo = 


Ppa 
7 Xo. 
Lp — plo 


ee 


ee ae 


SAS Sle SR eee ne wee TOT ee a 


VECTOR ANALYSIS. 45 


92. If w or w is a scalar or vector function of position in space, 
we may write Pot wu, New u, Pot @, Lap w for the volume-integrals of 
pot w’, etc. taken as functions of p’; i.e. we may set 


Potu=/// pot u’dv’ ar. 
New u=///new u'dv’ SE Vv 


Sah 


Potw=//f pot w'dv’ en dv 
Lapw=/// lap w' dv’ Saar Exo: 


where the p is to be regarded as constant in pn integration, This 
extends over all space, or wherever the w’ or w’ have any values 
other than zero. These integrals may themselves be called (integral) 
potentials, Newtonians, and Laplacians. 


d Pot w_ : du d Potw _p , do 
da da: eae Pi 


This will be evident with respect both to scalar and to vector functions, 
if we suppose that when we differentiate the potential with respect 
to « (thus varying the position of the point for which the potential 
is taken) each element of volume dv’ in the implied integral remains 
fixed, not in absolute position, but in position relative to the point 
for which the potential is taken. This supposition is evidently allow- 
able whenever the integration indicated by the symbol Pot tends to a 
definite limit when the limits of integration are indefinitely extended. 

Since we may substitute y and z for # in the preceding formula, 
and since a constant factor of any kind may be introduced under the 
sign of integration, we have 

V Pot u= Pot Vu, 
V. Pot w= Pot V.w, 
Vx Pot w= Pot Vxo, 
V.V Pot u= Pot V.Vu, 
V.V Pot w= Pot V.Vw, 
ie, the symbols V, V., Vx, V-V may be applied indifferently before 
or after the sign Pot. 

Yet a certain restriction is to be observed. When the operation of 
taking the (integral) potential does not give a definite finite value, 
the first members of these equations are to be regarded as entirely 
indeterminate, but the second members may have perfectly definite 
values. This would be the case, for example, if w or w had a constant 
value throughout all space. It might seem harmless to set an inde- 
finite expression equal to a definite, but it would be dangerous, since 


93. 
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we might with equal right set the indefinite expression equal to other 
definite expressions, and then be misled into supposing these definite 
expressions to be equal to one another. It will be safe to say that 
the above equations will hold, provided that the potential of u or w 
has a definite value. It will be observed that whenever Potw or 
Potw has a definite value in general (i.e., with the possible exception 
of certain points, lines, and surfaces),* the first members of all these 
equations will have definite values in general, and therefore the 
second members of the equation, being necessarily equal to the first 
members, when these have definite values, will also have definite 
values in general. 
94, Again, whenever Pot has a definite value we may write 


VPobu=W {f= de’ ={if V2 ui de’, 


where r stands for [p’—p]). But 
1_p-p 
$s ay. 


whence V Potu=New u. 


Moreover, New wu will in general have a definite value, if Pot wu has. 
95. In like manner, whenever Pot has a definite value, 


fe ; , : , 1 , , 
VxPotw=Vxf/f— dv =f Vx~ dw =f V-x dv’. 
Substituting the value of Ve given above we have 


Vx Pot w= Lap o. 
Lap w will have a definite value in general whenever Pot w has. 
96. Hence, with the aid of No. 93, we obtain 
VxLap w= Lap Vxo, 
V. Lap »=0, 
whenever Pot » has a definite value. 
97. By the method of No. 93 we obtain 


V.New u=V.//) sas w dv’ = [if Vu LaPaw. 


To find the value of this integral, we may regard the point p, which 
is constant in the integration, as the center of polar coordinates. 
Then 7 becomes the radius vector of the point p’, and we may set 


dv’ =r dq dr, 
* Whenever it is said that a function of position in space has a definite value in 


general, this phrase is to be understood as explained above. The term definite is 
intended to exclude both indeterminate and infinite values. 
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where 7*dq is the element of a spherical surface having center at p 
and radius 7. We may also set 


Vy pre 
” dr 
We thus obtain 


V.New u =f dq dr= nf oe Or = 41 W ao — ATU <9 


where wu denotes the average value of w in a spherical surface of 
radius r about the point p as center. 

Now if Pot w has in general a definite value, we must have wu’ =0 
for r=. Also, V.New wu will have in general a definite value. For 
r=0, the value of w’ is evidently u. We have, therefore, 

V.New w= —47ru, 


V.V Pot w= —47ru.* 


98. If Pot has in general a definite value, 


V.V Potw=V.V Pot [ui+vj+wk] 
=V.V Pot uz+V.V Pot w+V.V Pot wk 
= —4rui— 471) —4rwk 


= —47. 
Hence, by No. 71, 
Vx Vx Pot w— VV. Pot w= 47rw. 
That is, Lap Vxw—New V.o= 47. 


If we set @, = a Lap Vxo, W= = New v. ®, 


TT 
we have W=@, +H; 
where w, and w, are such functions of position that V.o,=0, and 
Vxo,=0. This is expressed by saying that w, is solenoidal, and w, 
irrotational. Potw, and Potw,, like Potw, will have in general 
definite values. 

It is worth while to notice that there is only one way in which a 
vector function of position in space having a definite potential can 
be thus divided into solenoidal and irrotational parts having definite 
potentials. For if w,+¢, o,—e¢ are two other such parts, 


V.e=0 and Vxe=0. 
Moreover, Pot e has in general a definite value, and therefore 
1 


New V.¢€=0. Q.E.D. 
4a 


A 
e=7— Lap Vxe— 


1 ] 
* Better thus: V.V Pot uafffVv.Vude=fifv.(Zvu)d - {II V-( uv )dv+ ffuv do 


=-/f uv .do=-—4ru. [MS. note by author.] 
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99, To assist the memory of the student, some of the _ principal 
results of Nos. 93-98 may be expressed as follows: 

Let w, be any solenoidal vector function of position in space, w, any 
irrotational vector function, and w any scalar function, satisfying the 
conditions that their potentials have in general definite values. 


; 1 
With respect to the solenoidal function ,, rie Lap and Vx are 
inverse operators ; i.€., 


1 
4, Lap Vxe.= xp —, Lap 0 =o. 


Applied to the irrotational function «,, either of these operators gives 
Zero ; 1é., 
Lapw,=0, Vxw,= 


With respect to the irrotational function ,, or the scalar function wu, 


1 bs : 
re New and —V. are inverse operators; Le., 
Tv 


1 1 
—q,, New V.@) =e; —V. 7 New u=u. 


Applied to the solenoidal function w,, the operator V. gives zero ; i.e. 
V.0, = 
Since the most general form of a vector function having in general 


a definite potential may be written w,+.,, the effect of these operators 
on such a function needs no especial mention. 


With respect to the solenoidal function Oy 4 _ Pot and VxVx are 


inverse ee i.e., 


i Pot VxV xo, = Vx Pot Vxw,= VxV xq = Pot, =a. 


With respect to the irrotational function o,, = Pot and —VV. are 
inverse operators ; Le., 


: 1 1 
ae Pot VV.w,= TN gee Pot V.w,= —_ VV. 7 Pot w= ws. 
With respect to any scalar or vector function having in general a 
definite potential —— Pot and —V.V are inverse operators; i.e., 


4ar 
1 1 uf 
—— Pot V.Vuw= —V.— Pot Vu= —V.V— Potu=wu 
Avr 4a Aa ; 


1 1 
ie Pot V.V[@,+0,] = — Abe Pot [w, + 0)] =o, +04. 


With respect to the solenoidal function w,, —V.V and VxVx are 
equivalent ; with respect to the irrotational function w, V.V and VV. 
are equivalent; i.e., 


—V. Vo,= VxVxo,, V.Vo,= VV... 


oy sere 
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100. On the interpretation of the preceding formule.—Infinite 
values of the quantity which occurs in a volume-integral as the 
coefficient of the element of volume will not necessarily make the 
value of the integral infinite, when they are confined to certain 
surfaces, lines, or points. Yet these surfaces, lines, or points may 
contribute a certain finite amount to the value of the volume-integral, 
which must be separately calculated, and in the case of surfaces or 
lines is naturally expressed as a surface- or line-integral. Such cases 
are easily treated by substituting for the surface, line, or point, a 
very thin shell, or filament, or a solid very small in all dimensions, 
within which the function may be supposed to have a very large 
value. 

The only cases which we shall here consider in detail are those of 
surfaces at which the functions of position (w or w) are discontinuous, 
and the values of Vu, Vxw, V.w thus become infinite. Let the 
function «w have the value w, on the side of the surface which we 
regard as the negative, and the value wu, on the positive side. Let 
Au=u,—u,. If we substitute for the surface a shell of very small 
thickness a, within which the value of w varies uniformly as we pass 


through the shell, we shall have Vu=v - within the shell, vy denoting 


a unit normal on the positive side of the surface. The elements of 
volume which compose the shell may be expressed by a[do],, where 
[do], is the magnitude of an element of the surface, do being the 
vector element. Hence, 


Vudv=v Aufdc],=Au de. 


Hence, when there are surfaces at which the values of w are 
discontinuous, the full value of Pot Vu should always be understood 
as including the surface-integral 


ae 1 da 


relating to such surfaces. (Aw’ and do’ are accented in the formula 


_ to indicate that they relate to the point p’.) 


In the ease of a vector function which is discontinuous at a surface, 
the expressions V.wdv and Vxwdv, relating to the element of the 
shell which we substitute for the surface of discontinuity, are easily 
transformed by the principle that these expressions are the direct 
and skew surface-integrals of w for the element of the shell. (See 


_ Nos. 55, 56.) The part of the surface-integrals relating to the edge 


of the element may evidently be neglected, and we shall have 
V.0 dv=,.de—,.do = Aw.de, 
Vx dv=dexa,—dexw,=doxAw. 
D 
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Whenever, therefore, w is discontinuous at surfaces, the expressions 
Pot V.w and New V.o must be regarded as implicitly including the 
surface-integrals 


Lot Wa Rees do’ and ipee ie rp Aw do" 


respectively, ae z such surfaces, and He Be e. Pot Vxw 
and Lap V x» as including the surface-integrals 


1 , , p- , 
pa d xA and “Sie eA d KA 
Np Sr Millet 4 Le — pls ise ashe 


respectively, relating to such surfaces. 

101. We have already seen that if w is the curl of any vector 
function of position, V.w=0. (No. 68.) The converse is evidently 
true, whenever the equation V.©=0 holds throughout all space, and 
w has in general a definite ps for then 


o= Vxp — Lape. 


Again, if V.o=0 within any aperiphractic space A, contained 
within finite boundaries, we may suppose that space to be enclosed by 
a shell B having its inner surface coincident} with the surface of A. 
We may imagine a function of position w’, such that o =o in A, wo =0 
outside of the shell B, and the integral ///w’.o dv for B has the least — 
value consistent with the conditions that the normal component of w’ 
at the outer surface is zero, and at the inner surface is equal to that 
of w, and that in the shell V.#’=0 (compare No. 90). Then V.0’=0 
throughout all space, and the potential of »’ will have in general a 
definite value. Hence, 


waVxp — Lape, 


and » will have the same value co the space A. 
+102. Def—If is a vector function of position in space, the Maa- 
wellian* of w is a scalar function of van defined by the equation 


Max off, F—4 ee ‘dv’. 


(Compare No. 92.) From this definition the following properties are 
easily derived. It is supposed that the functions » and w are such 

that their potentials have in general definite values. 

Max w= V. Pot w= Pot V.a, 
V Max w= VV. Pot w= New V.w, 
Max Vu= —47ru, 

47w=V xLap«w—V Maxo. 

*The frequent occurrence of the integral in Maxwell’s Treatise on Electricity and — 


Magnetism has suggested this name. 
+ [The foregoing portion of this paper was printed in 1881, the rest in 1884. ] 


; | 


¥ 
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Tf the values of Lap Lap w, New Max, and Max New uw are in general 
definite, we may add 

4 Pot w= Lap Lap w— New Max @, 

4 Pot w= — Max New w. 


In other words: The Maxwellian is the divergence of the potential, 
Max 


arg and V are inverse plies for scalars and irrotational vectors, 
for vectors in general — -V Max is an operator which separates the 
irrotational from the solenoidal part. For scalars and irrotational 
vectors, 7a Max New and T= New Max give the potential, for sole- 
i. Becton’ Lap Lap gives the potential, for vectors in ge 
7 New Max gives the potential of the irrotational part, and 7 Lap 


ee the potential of the solenoidal part. 

103. Def—The following double volume-integrals are of frequent 
occurrence in physical problems. They are all scalar quantities, and 
none of them functions of position in space, as are the single volume- 
integrals which we have been considering. The integrations extend 
over all space, or as far as the expression to be integrated has values 
other than zero. 

The mutual potential, or potential product, of two scalar functions 
of position in space is defined by the equation 


Pot (u, w)= (lf dv de =f u Por w dv = fff w Pot w de. 


In the double volume-integral, r is the distance between the two 
elements of volume, and wu relates to dv as w’ to dv’. 

The mutual potential, or potential product, of two vector functions 
of position in space is defined by the equation 


Pot (¢, 0) = 2 dv de! =f. Pot odv=/ff w. Pot de. 


The mutual Laplacian, or Laplacian product, of two vector 
functions of position in space is defined by the equation 
<p 
Lap (¢, o={f/f/) os 


x ¢ dv dy’ 


=//[fw.Lap pdv=//fo. Lap w dw. 


The Newtonian product of a scalar and a vector function of position 
in space is defined by the{equation 


New (u, ™=f//f/fffo- PoP wy dude’ =/[{fo.New udu. 
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The Maawellian product of a vector and a scalar function of 
position in space is defined by the equation 


Max (o, Wa/f///) WPS? of dude’ =fff'uMax odv=-New(u, w). 


It is of course supposed that u, w, ¢, » are such functions of position 
that the above expressions have definite values. 

104. By No. 97, 
Au Pot w= —V.New u Pot w= —V.[New u Pot w] + New wu. New w. 
The volume-integral of this equation gives 


4 Pot (wu, w)=//f New u.New wd, 
if the integral 
Sf do.New wu Pot w, 
for a closed surface, vanishes when the space included by the surface 
is indefinitely extended in all directions. This will be the case when 
everywhere outside of certain assignable limits the values of w and w 
are Zero. 
Again, by No. 102, 
Amw. Pot 6= Vx Lap w. Pot ¢— V Max w. Pot ¢ 
=V.[Lap wx Pot ¢] + Lap w.Lap ¢ 
— V.[Max w Pot ¢] + Max w Max ¢. 


The volume-integral of this equation gives 


4ar Pot (¢, 0) =//f Lap ¢.Lap w dv + f/f Max ¢ Max w dv, 
if the integrals 
Sf do.Lap w x Pot ¢, Jf do. Pot ¢ Max , 


for a closed surface vanish when the space included by the surface is 
indefinitely extended in all directions. This will be the case if every- 
where outside of certain assignable limits the values of ¢@ and w are 
Zero. 


CHAPTER III. 
CONCERNING LINEAR VECTOR FUNCTIONS. 


105. Def—A vector function of a vector is said to be linear, when 
the function of the sum of any two vectors is equal to the sum of the 
functions of the vectors. That is, if 


fune.[p+ p’]=funce.[p]+fune.[p’] 


for all values of p and p’, the function is linear. In such cases it is 
easily shown that 


func.[ap+ bp’ +cp” +ete.] =a fune.[p]+b fune. [e’]+e func.[p”]+ ete. 


Pere 
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106. An expression of the form 


ar.pt+B u.p+ete. 
evidently represents a linear function of p, and may be conveniently 
written in the form 

{art\+ Butete.}.p. 
The expression p-arAt+p.8 utete., 
or p-{ac\+Butete.}, 
also represents a linear function of p, which is, in general, different 
from the preceding, and will be called its conjugate. 

107. Def—An expression of the form ad or Bu will be called a 
dyad. An expression consisting of any number of dyads united by 
the signs + or — will be called a dyadic binomial, trinomial, ete., 
as the case may be, or more briefly, a dyadic. The latter term will 
be used so as to include the case of a single dyad. When we desire 
to express a dyadic by a single letter, the Greek capitals will be used, 
except such as are like the Roman, and also A and &. The letter I 
will also be used to represent a certain dyadic, to be mentioned 
hereafter. 

Since any linear vector function may be expressed by means of a 
dyadic (as we shall see more particularly hereafter, see No. 110), the 
study of such functions, which is evidently of primary importance in 
the theory of vectors, may be reduced to that of dyadics. 

108. Def—Any two dyadics and ¥ are equal, 

when @p=WV.p for all values of p, 


or, when p.P=p.¥ for all values of p, 
or, when o.®.p=c.¥.p for all values of o and of p. 


The third condition is easily shown to be equivalent both to the first 


and to the second. The three conditions are therefore equivalent. 

It follows that @=¥V, if Pp=WV.p, or p.P=p.V, for three non- 
complanar values of p. 

109. Def—We shall call the vector ®.p the (direct) product of 
®@ and p, the vector p.& the (direct) product of p and &, and the 
scalar «.&.p the (direct) product of o, &, and p. 

In the combination &.p, we shall say that @ is used as a prefactor, 
in the combination p.®, as a postfactor. 

110. If + is any linear function of p, and for p=%, p=j, p=k, the 
values of 7 are respectively a, 6, and y, we may set 


t={ait+hj+yk}.p, 
and also r=p.{iatjB+ky}. 


Therefore, any linear function may be expressed by a dyadic as 
prefactor and also by a dyadic as postfactor. 
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111. Def—We shall say that a dyadic is multiplied by a scalar, 
when one of the vectors of each of its component dyads is multiplied 
by that scalar. It is evidently immaterial to which vector of any 
dyad the scalar factor is applied. The product of the dyadic & and 
the scalar a may be written either a& or a. The minus sign before 
a dyadic reverses the signs of all its terms. 

112. The sign + in a dyadic, or connecting dyadics, may be 
regarded as expressing addition, since the combination of dyads and 
dyadics with this sign is subject to the laws of association and 
commutation. 

113. The combination of vectors in a dyad is evidently distributive. 


That 18, [a+B+ete.][A+utete.]=ar\+au+BrA+Bbutete. 


We may therefore regard the dyad as a kind of product of the two 
vectors of which it is formed. Since this kind of product is not 
commutative, we shall have occasion to distinguish the factors as 
antecedent and consequent. 

114. Since any vector may be expressed as a sum of 7,7, and k with 
scalar coefficients, every dyadic may be reduced to a sum of the nine 
ayeds ii, Yj, tk, 4, i, jh, bi, Wy, Ee, 
with scalar coefficients. Two such sums cannot be equal according to 
the definitions of No. 108, unless their coefficients are equal each to 
each. Hence dyadics are equal only when their equality can be 
deduced from the principle that the operation of forming a dyad is a 
distributive one. 

On this account, we may regard the dyad as the most general form 
of product of two vectors. We shall call it the indeterminate product. 
The complete determination of a single dyad involves five independent 
scalars, of a dyadic, nine. 

115. It follows from the principles of the last paragraph that if 


LaB=XeKa, 
th 
a DaxB==exr, 
d 
“ Lea.B=Z«.rA. 


In other words, the vector and the scalar obtained from a dyadic 
by insertion of the sign of skew or direct multiplication in each dyad 
are both independent of the particular form in which the dyadic is 
expressed. 


We shall write &, and , to indicate the vector and the scalar thus 
obtained, 


$, =(JjBL-LD jit (b.Bi-1.B.b)j + (1. j —j.B.ik, 
B= ti Dit jB.j+h Bk, 


eee ee 


wor =~ 


a a 


AS 
‘a 


VECTOR ANALYSIS. 55 


as is at once evident, if we suppose @ to be expanded in terms of 
UW, Y, ete. 

116. Def—The (direct) product of two dyads (indicated by a dot) 
is the dyad formed of the first and last of the four factors, multiplied 
by the direct product of the second and third. That is, 

{a8}.{yd} =a Bey d= By ad. 

The (direct) product of two dyadics is the sum of all the products 
formed by prefixing a term of the first dyadic to a term of the second. 
Since the direct product of one dyadic with another is a dyadic, it 
may be multiplied in the same way by a third, and so on indefinitely. 
This kind of multiplication is evidently associative, as well as dis- 
tributive. The same is true of the direct product of a series of factors 
of which the first and the last are either dyadics or vectors, and the 
other factors are dyadics. Thus the values of the expressions 


abO¥.8, ab, F608, b0¥ 


_ will not be affected by any insertion of parentheses. But this kind of 


multiplication is not commutative, except in the case of the direct 
product of two vectors. 
117. Def-—The expressions ®xp and px® represent dyadics which 
we shall call the skew products of ® and p. If 
; $=ar\+ Butete, 
these skew products are defined by the equations 
&xp=arAxXpt+h uxptete, 
px®=pxar+pxf u+ete. 
It is evident that 
{px®}.¥=px{fBV¥}, V.{Pxp} ={V.P} xp, 
{px®}.a=px[®.a], a.{@xp}=[a.P]xp, 
{oxP} xa=px{Pxa}. 
We may therefore write without ambiguity 
pxB.¥, V.bxp, pxP.a, aPxp, px.a. 
This may be expressed a little more generally by saying that the 
associative principle enunciated in No. 116 may be extended to cases 
in which the initial or final vectors are connected with the other 
factors by the sign of skew multiplication. 
Moreover, 
a.px®=[axp].& and $xp.a=®.[pxa]. 
These expressions evidently represent vectors, So 
WV.{pxP} ={¥xp}.P. 
These expressions represent dyadics. The braces cannot be omitted 
without ambiguity. 
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118. Since all the antecedents or all the consequents in any dyadic 
may be expressed in parts of any three non-complanar vectors, and 
since the sum of any number of dyads having the same antecedent 
or the same consequent may be expressed by a single dyad, it follows 
that any dyadic may be expressed as the sum of three dyads, and so, 
that either the antecedents or the consequents shall be any desired 
non-complanar vectors, but only in one way when either the ante- 
cedents or the consequents are thus given. 

In particular, the dyadic 

ati+bijy +crk 
+a +097 +¢9k 
+a" hitbo'hj+e'kk, 


which may for brevity be written 
an. bt | 
1 Mm feet § 
Ge pe “| 


is equal to ait Bj +-yk, 
where 


? 


a=ait+aj+a'k, 
B=bi +b) +06’k, 
y=ci +¢7 +c’k, 
and to iAtjuth, 


ae A=ai +4) ek 


p=ai +69 +ck 
v=Witby+c"k. 
119. By a similar process, the sum of three dyads may be reduced | 
to the sum of two dyads, whenever either the antecedents or the 


consequents are complanar, and only in such cases. To prove the 
latter point, let us suppose that in the dyadic 


aA + But yy | 
neither the antecedents nor the consequents are complanar. The } 
vector : 
{aA+Bu+tyr}.p 3 


is a linear function of p which will be parallel to a when p is perpen- 
dicular to «4 and v, which will be parallel to 8 when p is perpendicular 
to y and X, and which will be parallel to y when p is perpendicular to 
x and pw. Hence, the function may be given any value whatever by 3 
giving the proper value to p. This would evidently not be the case 


with the sum of two dyads. Hence, by No. 108, this dyadic cannot be 
equal to the sum of two dyads. 


ee ee 
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120. In like manner, the sum of two dyads may be reduced to a 
single dyad, if either the antecedents or the consequents are parallel, 
and only in such cases. 

A sum of three dyads cannot be reduced to a single dyad, unless 
either their antecedents or consequents are parallel, or both ante- 
cedents and consequents are (separately) complanar. In the first case 
the reduction can always be made, in the second, occasionally. 

121. Def—A dyadic which cannot be reduced to the sum of less 
than three dyads will be called complete. 

A dyadic which can be reduced to the sum of two dyads will be 
called planar. When the plane of the antecedents coincides with 
that of the consequents, the dyadic will be called wniplanar. These 
planes are invariable for a given dyadic, although the dyadic may be 
so expressed that either the two antecedents or the two consequents 
may have any desired values (which are not parallel) within their 
planes. 

A dyadic which can be reduced to a single dyad will be called 
linear. When the antecedent and consequent are parallel, it will be 
called wnilinear. 

A dyadic is said to have the value zero when all its terms vanish. 

122. If we set Bee Epa hes 
and give p all possible values, « and + will receive all possible values, 
if @ is complete. The values of o and + will be confined each to a 
plane if @ is planar, which planes will coincide if @ is uniplanar. 
The values of o and 7 will be confined each to a line if @ is linear, 
which lines will coincide if @ is unilinear. 

123. The products of complete dyadics are complete, of complete 
and planar dyadics are planar, of complete and linear dyadics are 
linear. 

The products of planar dyadics are planar, except that when the 
plane of the consequents of the first dyadic is perpendicular to the 
plane of the antecedents of the second dyadic, the product reduces 
to a linear dyadic. 

The products of linear dyadics are linear, except that when the 
consequent of the first is perpendicular to the antecedent of the 
second, the product reduces to zero. 

The products of planar and linear dyadics are linear, except when, 
the planar preceding, the plane of its consequents is perpendicular to 
the antecedent of the linear, or, the linear preceding, its consequent is 
perpendicular to the plane of the antecedents of the planar. In these 
cases the product is zero. 

All these cases are readily proved, if we set 

c=2.V.p, 
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and consider the limits within which o varies, when we give p all 
possible values. . 

The products ¥xp and px® are evidently planar dyadics. 

124, Def—A dyadic © is said to be an idemfactor, when 


®.o=p for all values of p, 
or when p-P=p for all values of p. 


If either of these conditions holds true, @ must be reducible to the 
at iibgj +h. 
Therefore, both conditions will hold, if either does. All such dyadics 
are equal, by No. 108. They will be represented by the letter I. 

The direct product of an idemfactor with another dyadic is equal to 
that dyadic. That is, 

Lé=4, Pi=, 

where @ is any dyadic. 

A dyadic of the form a Oe 
in which a’, ®’, y’ are the reciprocals of a, 8, y, is an idemfactor. 
(See No. 38.) A dyadic trinomial cannot be an idemfactor, unless its 
antecedents and consequents are reciprocals. 

125. If one of the direct products of two dyadies is an idemfactor, 
the other is also. For, if @v=I, 


tP.V=o 
for all values of oc, and & is complete ; 
o-B.V.P=o.P 


for all values of o, therefore for all values of o.@, and therefore 
Vd=l. 

Def.—In this case, either dyadic is called the reciprocal of the 
other. 

It is evident that an incomplete dyadic cannot have any (finite) 


reciprocal. 
Reciprocals of the same dyadic are equal. For if @ and ¥ are both 
reciprocals of Q, ele one 


If two dyadics are reciprocals, the operators formed by using these 
dyadics as prefactors are inverse, also the operators formed by using 
them as postfactors. 

126. The reciprocal of any complete dyadic 

ar+ But yy 
is Na’ +p’ B' +y'y’, 
where a’, 8’, y’ are the reciprocals of a, 8, y, and X’, pw’, v’ are the 
reciprocals of A, u,v. (See No. 38.) 
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127. Def—We shall write 6-1 for the reciprocal of any (complete) 
dyadic ®, also $2 for 4, etc. and }-2, for @-1-1! ete. It is 
evident that -” is the reciprocal of 6”, 

128. In the reduction of equations, if we have 


PY=F.0, 


we may cancel the (which is equivalent to multiplying by -}) if 
® is a complete dyadic, but not otherwise. The case is the same with 
such equations as 


Po=Pp, VP=0.F, p.P=cF. 


To cancel an incomplete dyadic in such cases would be analogous to 
cancelling a zero factor in algebra. 

129. Def—tIf in any dyadic we transpose the factors in each term, 
the dyadic thus formed is said to be conjugate to the first. Thus 


aA+Butyv and rAat+uB+ry 


are conjugate to each other. A dyadic of which the value is not 
altered by such transposition is said to be self-conjugate. The con- 
jugate of any dyadic ® may be written ®,. It is evident that 


p-& =p.p and P.p =p-Po. 


Py.p and ®.p are conjugate functions of p. (See No. 106.) Since 
{@,}?={?},, we may write $j, etc., without ambiguity. 
130. The reciprocal of the product of any number of dyadics is 
equal to the product of their reciprocals taken in inverse order. Thus 
PP OV AD TF elt Oy et 


The conjugate of the product of any number of dyadics is equal to 
the product of their conjugates taken in inverse order. Thus 


{BVO}, = Og. o-Lo.- 

Hence, since Po.{P-4} 9 ={P-1H} =], 

12ct}o= {Po} -*, 
and we may write &¢' without ambiguity. 

131. It is sometimes convenient to be able to express by a dyadic 
taken in direct multiplication the same operation which would be 
effected by a given vector (a) in skew multiplication. The dyadic 
Ixa will answer this purpose. For, by No. 117, 

{Ixa}.p=axp, p-{Ixa} =pxa, 
{Ixa}.6=ax®, &.{Ixa}=xa. 
The same is true of the dyadic axI, which is indeed identical with 
Ixa, as appears from the equation I.{axI}= {Ixa}.L. 
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If a is a unit vector, 
{Ixa}?= —{I—aa}, 
{Ixa}= —I xa, 


{Ixa}*=I-—aa, 
{IxaP=1 xX a, 
ete. 


If 7,7, k are a normal system of unit vectors 
Ixi =txI=ky —jk. 
Ixj =j xI=tk—-h, 
Ixk=kxI=1 —y. 


If a and # are any vectors, 


[ax6]xI=Ix[ax8]=Ba—af. 

That is, the vector ax§ as a pre- or post-factor in skew multipli- 
cation is equivalent to the dyadic {@8a—a8} taken as pre- or post- 
factor in direct multiplication. 

[ax8]xp={Ba—af}.p, 
px[ax8]=p.{Ba—aB}. 
This is essentially the theorem of No. 27, expressed in a form more 


symmetrical, and more easily remembered. 
132. The equation 


aBxy + Byxa+ yaxB=a.Bxyl 
gives, on multiplication by any vector p, the identical equation 
p.aBxy + p.Byxa+ py axB=a.Bxy p. 
(See No. 37.) The former equation, is therefore identically true. 
(See No. 108.) It is a little more general than the equation 


aa’ + BB+ yy'=I, 
which we have already considered (No. 124), since, in the form here 
given, it is not necessary that a, 8, and y should be non-complanar. 
We may also write 
BxyatyxaB+axB y=a.BxyL. 
Multiplying this equation by p as prefactor (or the first equation by 
p as postfactor), we obtain 
p.Bxya+ p-yxaB + p.axBy=a.Bxy p. 
(Compare No, 37.) For three complanar vectors we have - 
aBxy + Byxa+ yaxB=0. 


Multiplying this by », a unit normal to the plane of a, @, and y 
we have 


aBxy.v+ Byxav+ yaxB.v=0. 


d 
; 
' 
4 
: 
4 
4 
4 


eek 


a We 


VECTOR ANALYSIS. 61 


This equation expresses the well-known theorem that if the geometrical 
sum of three vectors is zero, the magnitude of each vector is propor- 
tional to the sine of the angle between the other two. It also indicates 
the numerical coefficients by which one of three complanar vectors may 
be expressed in parts of the other two. 

133. Def—If two dyadics ® and Y are such that 


bV=V-G, 


they are said to be homologous. 

If any number of dyadics are homologous to one another, and any 
other dyadics are formed from them by the operations of taking 
multiples, sums, differences, powers, reciprocals, or products, such 
dyadics will be homologous to each other and to the original dyadics. 
This requires demonstration only in regard to reciprocals. Now if 


$f =V.9, 
WP SOTO Ota bP -Ly PiG- 1a S- LY, 


That is, 6-1 is homologous to ¥, if © is. 

134. If we call V.b-! or 6-1-¥ the quotient of Y and ®, we may 
say that the rules of addition, subtraction, multiplication and division 
of homologous dyadics are identical with those of arithmetic or 
ordinary algebra, except that limitations analogous to those respecting 
zero in algebra must be observed with respect to all incomplete 
dyadics. / 

It follows that the algebraic and higher analysis of homologous 
dyadics is substantially identical with that of scalars. 

135. It is always possible to express a dyadic in three terms, so 
that both the antecedents and the consequents shall be perpendicular 
among themselves. 

To show this for any dyadic ®, let us set 

p =2.p, 

p being a unit-vector, and consider the different values of p’ for all 
possible directions of p. Let the direction of the unit vector 7 be so 
determined that when p coincides with 12, the value of p’ shall be at 
least as great as for any other direction of p. And let the direction 
of the unit vector j be so determined that when p coincides with 9, 
the value of p’ shall be at least as great as for any other direction of p 
which is perpendicular to 7. Let & have its usual position with 
respect toi and j. It is evidently possible to express $ in the form 


ai+ icy] + yk. 
We have therefore i 
p={aithjt+yk}-p, 


and dp = {ait Bj +yk}.dp. 
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Now the supposed property of the direction of 2 requires that when p 
coincides with i and dp is perpendicular to 2, dp’ shall be perpendicular 
to p’, which will then be parallel to a. But if dp is parallel to or k, 
it will be perpendicular to i, and dp’ will be parallel to 8 or y, as the 
case may be. Therefore § and y are perpendicular to a. In the same 
way it may be shown that the condition relative to ) requires that y 
shall be perpendicular to 8. We may therefore set 
P= ali +i +chk, 
where 7’, 7’, k’, like 1, j, k, constitute a normal system of unit vectors 


(see No. 11), and a, b, ¢ are scalars which may be either positive or . 


negative. 

It makes an important difference whether the number of these 
scalars which are negative is even or odd. If two are negative, say a 
and b, we may make them positive by reversing the directions of 7 
and j’. The vectors 7’, 7’, k’ will still constitute a normal system. 
But if we should reverse the directions of an odd number of these 
vectors, they would cease to constitute a normal system, and to be 
superposable upon the system 7, 7, k. We may, however, always set 
anes Baali+yij+elh, 
or @= —{avit+hjj+ech’k}, 
with positive values of a,b, andc. At the limit between these cases 
are the planar dyadics, in which one of the three terms vanishes, and 
the dyadic reduces to the form 

ava + bj, 
in which a and b may always be made positive by giving the proper 
directions to 2’ and 7’. 

If the numerical values of a, 6, ¢ are all unequal, there will be only 
one way in which the value of @ may be thus expressed. If they are 
not all unequal, there will be an infinite number of ways in which ® 
may be thus expressed, in all of which the three scalar coefficients 
will have the same values with exception of the changes of signs 
mentioned above. If the three values are numerically identical, we 
may give to either system of normal vectors an arbitrary position. 


136. It follows that any self-conjugate dyadic may be expressed 


noth secs 

in the form aii-+ bij +ckk, 

where 2, 7, & are a normal system of unit vectors, and a, b, ¢ are 
positive or negative scalars. 

137. Any dyadic may be divided into two parts, of which one shall 
be self-conjugate, and the other of the form Ixa, These parts are 
found by taking half the sum and half the difference of the dyadic 
and its conjugate. It is evident that 


b= F {P+ Po} +3{P—Fo}. 
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Now 3{+ 4} is self-conjugate, and 


${&—Hp} =Ix[—44,]. 
(See No. 131.) 


Rotations and Strains. 


138. To illustrate the use of dyadics as operators, let us suppose 
that a body receives such a displacement that 


p=2P.p, 
p and p’ being the position-vectors of the same point of the body in 
its initial and subsequent positions. The same relation will hold of 
the vectors which unite any two points of the body in their initial 


and subsequent positions. For if p,, p, are the original position- 
vectors of the points, and p,’, p,’ their final position-vectors, we have 


p1 = 2.1), pz =. pp, 
po — p: =.[p2— pil. 

In the most general case, the body is said to receive a homogeneous 
strain. In special cases, the displacement reduces to a rotation. 
Lines in the body initially straight and parallel will be straight and 
parallel after the displacement, and surfaces initially plane and 
parallel will be plane and parallel after the displacement. 

139. The vectors (c, o’) which represent any plane surface in the 
body in its initial and final positions will be linear functions of each 
other. (This will appear, if we consider the four sides of a tetra- 
hedron in the body.) To find the relation of the dyadics which 
express o’ as a function of c, and p’ as a function of p, let 


p ={a\t+Buty}-p. 


Then, if we write X’, uw’, v for the reciprocals of 2, u, v, the vectors 
’, «, v’ become by the strain a, 8, y. Therefore the surfaces p’xy’, 
/xnr, Xx’ become Bxy, yxa, axP. But px’, xX’, ’xp’ are the 
reciprocals of uxv, vxA, AXu. The relation sought is therefore 


whence 


o ={Bxy uxv + yXavxrA + axB Axyu}-o. 


140. The volume ’.u’xv becomes by the strain a.8xy. The 
unit of volume becomes therefore (a.8xy)(A.uXv). 

Def—tlt follows that the scalar product of the three antecedents 
multiplied by the scalar product of the three consequents of a dyadic 
expressed as a trinomial is independent of the particular form in 
which the dyadic is thus expressed. This quantity is the determinant 
of the coefficients of the nine terms of the form 


aii+by +ete., 
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into which the dyadic may be expanded. We shall call it the 
determinant of the dyadic, and shall denote it by the notation 
|®| 
when the dyadic is expressed by a single letter. 
If a dyadic is incomplete, its determinant is zero, and conversely. 
The determinant of the product of any number of dyadics is 
equal to the product of their determinants. The determinant of the 
reciprocal of a dyadic is the reciprocal of the determinant of that 
dyadic. The determinants of a dyadic and its conjugate are equal. 
The relation of the surfaces o’ and « may be expressed by the equation 
o =|@|Po-1.0.* 
141. Let us now consider the different cases of rotation and strain 
as determined by the nature of the dyadic ®. 
If @ is reducible to the form 
Vitjyt+kh, 
i,j,k, v, 9’, k’ beg normal systems of unit vectors (see No. 11), the 
body will suffer no change of form. For if 
p=ai+y)+zk, 
we shall have 
p=a'+yj +k. 
Conversely, if the body suffers no change of form, the operating 
dyadic is reducible to the above form. In such cases, it appears from 
simple geometrical considerations that the displacement of the body 
may be produced by a rotation about a certain axis. A dyadic 
reducible to the form ; 
Vitjjtkk 
may therefore be called a versor. 
142. The conjugate operator evidently produces the reverse rotation. 
A versor, therefore, is the reciprocal of its conjugate. 
Conversely, if a dyadic is the reciprocal of its conjugate, it is either 
a versor, or a versor multiplied by —1. For the dyadic may be 
expressed in the form 
ati+ Bj + yk. 
Its conjugate will be ta+jB+ky. 
If these are reciprocals, we have 


tat+ Aj+ yk} {ia+jB+ky} =aat+ BB+ yy=L 

But this relation cannot subsist unless a, 8, y are reciprocals to 
themselves, ie., unless they are mutually perpendicular unit-vectors. 
Therefore, they either are a normal system of unit-vectors, or will 
become such if their directions are reversed. Therefore, one of the 
dyadics 

av+Bjtyk and —ai—Bj—yk 

is a versor. 


*[See note on p. 90.] 


a : 
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The criterion of a versor may therefore be written 
PPo=I1, and |S\|=1. 
For the last equation we may substitute 
|@|>0, or |S|/=—1. 
It is evident that the resultant of successive finite rotations is 
_ obtained by multiplication of the versors. 
{ 143. If we take the axis of the rotation for the direction of 4, 
_ v will have the same direction, and the versor reduces to the form 
U+jj+Kk, 
_ in which 4, j, & and 7, j’, k’ are normal systems of unit vectors. 
; We may set 
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j =cosqj+singk, 
k’=cosqk — sing j, 


_ and the versor reduces to 


ww + cos q {jj +kk} + sing {kj —jk}, 
or 
vw + cosqg {I—w} + sing Ixi, 
where q is the angle of rotation, measured from j toward k, if the 
versor is used as a prefactor. 

144. When any versor ® is used as a prefactor, the vector —®,, 
will be parallel to the axis of rotation, and equal in magnitude to 
twice the sine of the angle of rotation measured counter-clockwise as 
seen from the direction in which the vector points. (This will appear 
if we suppose ® to be represented in the form given in the last 
paragraph.) The scalar , will be equal to unity increased by twice 
the cosine of the same angle. Together, —®, and ®, determine the 
versor without ambiguity. If we set 


nth. 
ie. 
the magnitude of @ will be 
2 sin g 
3+2 cos q or tan 4q, 


__ where g is measured counter-clockwise as seen from the direction in 
which @ points. This vector 0, which we may call the vector semi- 
| tangent of version, determines the versor without ambiguity. 

145. The versor & may be expressed in terms of @ in various ways. 
Since @ (as prefactor) changes a—@xa into a+0@xa (a being any 


tor), we have 
@= {I+1x6}.{I—Ix6}-*. 


96+ {I+1x6}?_ (1—0.0)14+200+21x0 
San 1+0.0 , 
G. Il. E 


Again 
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as will be evident on considering separately in the expression P.p 

the components perpendicular and parallel to @, or on substituting in 
ii + cosg (Q7+kk) + sin q (ky —jk) 


for cos q and sin q their values in terms of tan }q. 
If we set, in either of these equations, 


6=a1+b)+ck, : 


we obtain, on reduction, the formula 


+(2ab + 2c) +1 mip re HL Cee cd 
+(2ac — 2b) ki + (2be+ 2a)kj +1 —2—B +e) kk 
1+a?+6?+¢? 
in which the versor is expressed in terms of the rectangular com- 
ponents of the vector semitangent of version. 
146. If a, 8, y are unit vectors, expressions of the form 


2aa—I, 288-1, 2yy—-L 
are biquadrantal versors. A product like 


{288-1} .{2aa—I} 
is a versor of which the axis is perpendicular to a and £8, and the 
amount of rotation twice that which would carry a to 6. It is 
evident that any versor may be thus expressed, and that either a or 8 
may be given any direction perpendicular to the axis of rotation. If 


@={268-—I}.{2aa—I}, and WV={2yy—I}.{2B8—-I}, 
we have for the resultant of the successive rotations 
V.b= {2yy—Ij .{2aa—T}. 

This may be applied to the composition of any two successive 
rotations, 6 being taken perpendicular to the two axes of rotation, 
and affords the means of determining the resultant rotation by 
construction on the surface of a sphere. It also furnishes a simple 
method of finding the relations of the vector semitangents of version 
for the versors ®, VW, and ¥.d. Let 


Ne =, =—¥,, =—(frae 


{ans b?—c?)ii + (2ab — 2c) ij + (2ac + 2b) ik | 


P= 


~ 1+,’ Ty, Noe 1+{V.o},° 
Then, since 
=4a.8 Ba—2aa—288+1, 
axB 
oe 


which is moreover geometrically evident. In like manner, 
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Therefore, 
_lax8]x[Bxy]_ax8-y 8 
KO abby aBBy 
_B.a Bxy+6.Byxat+B-y axB 
a.B By 
(See No. 38.) That is, 


0,x0,= 0. Fe 0; + 4. 


Eto: AO Cos ee 
a.B B.y a.B By 
Hence, 0, x0, = 0,—(1 — 0,.0,)0,+6,, 
Ave: 6, + 0, + 0, xO, 
a (=6, 6, 


which is the formula for the composition of successive finite rotations 
by means of their vector semitangents of version. 

147. The versors just described constitute a particular class under 
the more general form 

aa’ + cosqi{BB'+yy'} + sing iy8'— By}, 

in which a, 8, y are any non-complanar vectors, and a’, 6’, y’ their 
reciprocals. A dyadic of this form as a prefactor does not affect any 
vector parallel to a. Its effect on a vector in the 8-y plane will be 
best understood if we imagine an ellipse to be described of which 
8 and y are conjugate semi-diameters. If the vector to be operated 
on be a radius of this ellipse, we may evidently regard the ellipse 
with 8, y, and the other vector, as the projections of a circle with two 
perpendicular radii and one other radius. A little consideration will 
show that if the third radius of the circle is advanced an angle q, its 
projection in the ellipse will be advanced as required by the dyadic 
prefactor. The effect, therefore, of such a prefactor on a vector in the 
B-y plane may be obtained as follows: Describe an ellipse of which 
B and y are conjugate semi-diameters. Then describe a similar and 
similarly placed ellipse of which the vector to be operated on is a 
radius. The effect of the operator is to advance the radius in this 
ellipse, in the angular direction from 6 toward y, over a segment 
which is to the total area of the ellipse as q is to 27. When used as 
a postfactor, the properties of the dyadic are similar, but the axis of 
no motion and the planes of rotation are in general different. 

Def.—Such dyadics we shall call cyclee. 

The Nth power (N being any whole number) of such a dyadic is 
obtained by multiplying qg by N. If q is of the form 27N/M (N and 


_M being any whole numbers) the Mth power of the dyadic will be an 
 idemfactor. A cyclic dyadic, therefore, may be regarded as a root of 
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I, or at least capable of expression with any required degree of 
accuracy as a root of I. 

It should be observed that the value of the above dyadic will not 
be altered by the substitution for a of any other parallel vector, or 
for B and y of any other conjugate semi-diameters (which succeed one 
another in the same angular direction) of the same or any similar and 
similarly situated ellipse, with the changes which these substitutions 
require in the values of a’, 8’, y’. Or, to consider the same changes 
from another point of view, the value of the dyadic will not be altered 
by the substitution for a’ of any other parallel vector or for 6’ and ’ 
of any other conjugate semi-diameters (which succeed one another in 
the same angular direction) of the same or any similar and similarly 
situated ellipse, with the changes which these substitutions require in 
the values of a, 8, and y, defined as reciprocals of a’, 8’, y’. 

148. The strain represented by the equation 

p= {ari+b7j +ckk}. p 
where a, 6, ¢ are positive scalars, may be described as consisting of 
three elongations (or contractions) parallel to the axes 7, 7, k, which 
are called the principal axes of the strain, and which have the pro- 
perty that their directions are not affected by the strain. The scalars 
a, b, e are called the principal ratios of elongation. (When one of 
these is less than unity, it represents a contraction.) The order of 
the three elongations is immaterial, since the original dyadic is equal 
to the product of the three dyadics 
avity +kk, w+byj +kk, w+jj +ckk 

taken in any order. 

Def—A dyadic which is reducible to this form we shall call a right 
tensor. The displacement represented by a right tensor is called a 
pure strain. A right tensor is evidently self-conjugate. 

149. We have seen (No. 185) that every dyadic may be expressed 
in the form 

+ {avi+bjij+ck’k}, 

where a, 6, ¢ are positive scalars. This is equivalent to 

+ {aaa + by’ +ch'k’} {vi +79 +Kk} 
and to 

{Vit jj +khk}. {aiit+ jj +ckk}. 

Hence every dyadic may be expressed as the product of a versor and 
a right tensor with the scalar factor +1. The versor may precede or 
follow. It will be the same versor in either case, and the ratios of 
elongation will be the same; but the position of the principal axes 
of the tensor will differ in the two cases, either system being derived 
from the other by multiplication by the versor. 
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Def.—The displacement represented by the equation 
Sead 
is called inversion. The most general case of a homogeneous strain 
may therefore be produced by a pure strain and a rotation with or 
without inversion. 

150. If 

b=av'ithjij+ckk, 

P.dy= 0707 +b79'97’ + PKK’, 
and Do.P = 071+ 0797 +7kk. 
The general problem of the determination of the principal ratios and 
axes of strain for a ‘given dyadic may thus be reduced to the case of 
a right tensor. 
151. Def—tThe effect of a prefactor of the form 

daa’ +bBB’+eyy’, 
where a, 6, ¢ are positive or negative scalars, a, 8, y non-complanar 
vectors, and a’, 6’, y’ their reciprocals, is to change a into aa, 8 into 
68, and y into cy. As a postfactor, the same dyadic will change a’ 
into aa’, 8’ into 68’, and y’ into cy’. Dyadics which can be reduced to 
this form we shall call tonic (Gr. reivw). The right tensor already 
described constitutes a particular case, distinguished by perpendicular 
axes and positive values of the coeflicients a, b, c. 

The value of the dyadic is evidently not affected by substituting 
vectors of different lengths but the same or opposite directions for 
a, 8, y, with the necessary changes in the valuesof a’, ’, y’, defined 
as reciprocals of a, B, y. But, except this change, if a, 6, ¢ are 
unequal, the dyadic can be expressed only in one way in the above 
form. If, however, two of these coefficients are equal, say a and J, 
any two non-collinear vectors in the a-§ plane may be substituted 
for a and £, or, if the three coefficients are equal, any three non- 
complanar vectors may be substituted for a, 8, y. 

152. Tonics having the same axes (determined by the directions 
of a, 8, y) are homologous, and their multiplication is effected by 
multiplying their coefficients. Thus, 

{a,aa' +b, BB’ + cyyy’} {aaa +b,88' +cxyy'} 

= {a,a,00' +,),88' + ccyyy }. 
Hence, division of such dyadics is effected by division of their co- 
efficients. A tonic of which the three coefficients a, 6, c are unequal, 
is homologous only with such dyadics as can be obtained by varying 
the coefficients. 

153. The effect of a prefactor of the form 


aaa’ + b{BB’+yy'} + e{y0’—By'}, 
or aaa’ + peosg {BB +yy'} + psing{yB’—By'}, 
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where a’, 8’, y’ are the reciprocals of a, B, y, and a, b, ¢, p, and q are 
scalars, of which p is positive, will be most evident if we resolve it 
into the factors 
aaa’ + BB’ +yy', 
aa'+pBB'+pyy 
aa’ + cos q {BB'+yy'} + sin g{yB'—By'}, 

of which the order is immaterial, and if we suppose the vector on 
which we operate to be resolved into two factors, one parallel to a, 
and the other in the B-y plane. The effect of the first factor is to 
multiply by a the component parallel to a, without affecting the other. 
The effect of the second is to multiply by p the component in the 
B-y plane without affecting the other. The effect of the third is to 
give the component in the ®-y plane the kind of elliptic rotation 
described in No. 147. 

The effect of the same dyadic as a postfactor is of the same nature. 

The value of the dyadic is not affected by the substitution for a of 
another vector having the same direction, nor by the substitution for 
8 and y of two other conjugate semi-diameters of the same or a 
similar and similarly situated ellipse, and which follow one another 
in the same angular direction. 

Def.—Such dyadics we shall call cyclotonie. 

154, Cyclotonics which are reducible to the same form except 
with respect to the values of a, p, and g are homologous. They are 
multiplied by multiplying the values of a, and also those of p, and 
adding those of g. Thus, the product of 


a,aa’ + p, cos 4, {BB+ yy't + p, sing, {yB’—By'} 
and Aaa’ + PCO Jo {BB'+-yy'} + p.sin J2{yP' — By’} 
is AyUyaa’ + Py Po COS (9, +92){BB + yy} 
+ P1P2 Sin (9, +92) 'yB — By’}- 

A dyadic of this form, in which the value of q is not zero, or the 
product of 7 and a positive or negative integer, is homologous only 
with such dyadics as are obtained by varying the values of a, p, and q. 

155. In general, any dyadic may be reduced to the form either of a 
tonic or of a cyclotonic. (The exceptions are such as are made by 
the limiting cases.) We may show this, and also indicate how the 
reduction may be made, as follows. Let @ be any dyadic. We have 
first to show that there is at least one direction of p for which 

®, P= ap. 
This equation is equivalent to 
P.p—ap=0, 
or, {$—al}.p=0. 
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That is, @—al is a planar dyadic, which may be expressed by the 


equation 
(See No. 140.) Let 
the equation becomes 
|[A—at]i+[u—ag]j+[v—ak]k| =0, 
or, [A — a2] x [u—aj]. [v—ak]=0, 
or, @—(.A+j.utkh.v)a?+(i.uxv+j.vxrA tk. ArAXp)a—AX pv =0. 
This may be written 
a?— Psa? + {b-1}5|h|a—|S|=0.* 
Now if the dyadic & is given in any form, the scalars 
Ps, {P-*}s, |P| 
are easily determined. We have therefore a cubic equation in a, for 
which we can find at least one and perhaps three roots. That is, we 


can find at least one value of a, and perhaps three, which will satisfy 
the equation \S—al| =0 


|@—al|=0. 
B=di+ pj +k; 


By substitution of such a value, 6—al becomes a planar dyadic, the 
planes of which may be easily determined.t Let a be a vector 
normal to the plane of the consequents. Then 


{b—al}.a=0, 
P.a=aa. 
If @ is a tonic, we may obtain three equations of this kind, say 
®.a=aa, &.8=b8£, Py =cy, 


in which a, 8, y are not complanar. Hence (by No. 108), 


@=aaa' + bBR’ a4 eyy’, 
where a’, 8’, y’ are the reciprocals of a, 6, y. 

In any case, we may suppose a to have the same sign as |®|, since 
the cubic equation must have such a root. Let a (as before) be 
normal to the plane of the consequents of the planar @—al, and a’ 
normal to the plane of the antecedents, the lengths of a and a’ being 
such that a.a’=1.t Let B be any vector normal to a’, and such that 
.8 is not parallel to 8. (The case in which &.8 is always parallel 
to 6, if 8 is perpendicular to a’, is evidently that of a tonic, and needs 
no farther discussion.) {®—al}.8 and therefore &.8 will be per- 
pendicular to a’. The same will be true of 6°,8. Now (by No. 140) 

[®.a].[&. 8] x[&.B]=|9| a.[&. 5} xB, 
that is, aa.[&?. B]x[&. 8] =|®| a.[&. 8] x8. 
*TSee note on p. 90.] 
+In particular cases, 6-al may reduce to a linear dyadic, or to zero. These, how- 


ever, will present no difficulties to the student. 
+ For the case in which the two planes are perpendicular to each other, see No. 157. 
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Hence, since [@2.8]x[®.8] and [®.8]x are parallel, 
a[&*. 8] x[®.B]=|2|[&.8]xP. 
Since a-1|®| is positive, we may set 
pram P|: 


Py=p7*®.B, B,=p*#*B, ete. 

B-.=p*8, B.,=p'b*.B, ete, 
the vectors 8, B,, Bo, etc, B-1, B-», ete. will all lie in the plane 
perpendicular to a’, and we shall have 


Bz x By = 8, x B, 
[B.+ 8] x B,=0. 
We may therefore set B,+B=2n8,. 


Multiplying by p-1®, and by p®-}, 
63+ B,=2nB,, B,+8,=2nB,, ete., 
B,+8-1=2n8B, B+8-.=2nB_,, ete. 


Now, if » > 1, and we lay off from a common origin the vectors 


B, By, Bs, ete, B_1, B-s, ete., 

the broken line joining the termini of these vectors will be convex 
toward the origin. All these vectors must therefore lie between two 
limiting lines, which may be drawn from the origin, and which may 
be described as having the directions of 8, and 6_..* A vector 
having either of these directions is unaffected in direction by 
multiplication by ®. In this case, therefore, ® is a tonic. Ifn<—1 
we may obtain the same result by considering the vectors 


B, —B1, By —Bs, By, ete, —B_,, B_,, —B-s, ete, 
except that a vector in the limiting directions will be reversed in 
direction by multiplication by ©, which implies that the two 
corresponding coefficients of the tonic are negative. 
If 1>n>-—1,T we may set 
N= COS q. 
Then B_,+8,=2cosq B. 


Let us now determine y by the equation 
8, =cos g 8+sin q y. 
This gives B_,=cosq B—sing y. 
Now a’ is one of the reciprocals of a, 8, and y. Let §’ and y’ be the 
others. If we set 
¥=cos 9 {BB+ yy} +sin 9 {yG’— By}, 
we have a=); 83,5 eS ekryes 


If we also set 


* The termini of the vectors will in fact lie on a hyperbola. 
+ For the limiting cases, in which n=1, or n= -—1, see No. 156. 
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Therefore, since {aac’+pV¥}.a =aa=9.a, 
{aaa’+pV}.8=p8,=.8, 
{aaa’+p¥}.8..=pB=6.8.., 

it follows (by No. 108) that 


@=aaa'+p¥=aaa’ + pcos g{B8'+ yy} + psing {yB’—By’}. 


156. It will be sufficient: to indicate (without demonstration) the 
forms of dyadics which belong to the particular cases which have 
been passed over in the preceding paragraph, so far as they present 
any notable peculiarities. 

If n= +1 (page 72), the dyadic may be reduced to the form 


aaa’ +b{BR'+ yy} +beBy’, 
where a, 8, y are three non-complanar vectors, a’, (’, y their reci- 
procals, and a, b,c positive or negative scalars. The effect of this as an 
operator, will be evident if we resolve it into the three homologous 
factors 
aaa’ + BB+ yy’, 

aa’ +b{BB'+yy'}, 

aa’ + BB + yy’ + By. 
The displacement due to the last factor may be called a swmple shear. 
It consists (when the dyadic is used as prefactor) of a motion parallel 
to 8, and proportioned to the distance from the a-8 plane. This 
factor may be called a shearer. 

This dyadic is homologous with such as are obtained by varying 
the values of a, b,c, and only with such, when the values of a and b 
are different, and that of ¢ other than zero. 

157. If the planar 6—al (page 71) has perpendicular planes, there 
may be another value of a, of the same sign as |®|, which will give a 
planar which has not perpendicular planes. When this is not the 
case, the dyadic may always be reduced to the form 


afad’ + BB’ + yy} +ab{aB'+ By} +acay’, 
where a, 8, y are three non-complanar vectors, a’, 6’, y’, their reci- 


procals, and a, b,c, positive or negative scalars. This may be resolved 
into the homologous factors 


al and I+b{a8’+By'}+cay’. 


The displacement due to the last factor may be called a complex shear. 
It consists (when the dyadic is used as prefactor) of a motion parallel 
to a which is proportional to the distance from the a-y plane, together 
with a motion parallel to }8+ca which is proportional to the distance 
from the a-8 plane. This factor may be called a complex shearer. 
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This dyadic is homologous with such as are obtained by varying 
the values of a, b, c, and only such, unless b=0. 

It is always possible to take three mutually perpendicular vectors 
for a, 8, and y; or, if it be preferred, to take such values for these 
vectors as shall make the term containing ¢ vanish. 

158. The dyadics described in the two last paragraphs may be 
called shearing dyadics. 

The criterion of a shearer is 


{6-I}}=0, @—-1+0. 
The criterion of a simple shearer is 

{6-I}*=0, G-I1+0. 
The criterion of a complex shearer is 

{@—I}*=0, {@—I}?+0. 


Norr.—If a dyadic @ is a linear function of a vector p (the term linear being used in 
the same sense as in No. 105), we may represent the relation by an equation of the form 


S=aBy.pt+efy.pt+ete., 
or = {aBy+efn+ etc. }.p, 


where the expression in the braces may be called a triadic polynomial, and a single term 
apy a triad, or the indeterminate product of the three vectors a, 8, y. We are thus led 
successively to the consideration of higher orders of indeterminate products of vectors, 
triads, tetrads, etc., in general polyads, and of polynomials consisting of such terms, 
triadics, tetradics, etc,, In general polyadics. But the development of the subject in 
this direction lies beyond our present purpose. 

It may sometimes be convenient to use notations like 


A, My P Ay My P 
a 
ja, B, ¥ a, B, y| 


to represent the conjugate dyadics which, the first as prefactor, and the second as 


postfactor, change a, 8, y into ), uw, v, respectively. In the notations of the preceding 
chapter these would be written 


ra’ + up! +yy and a/A+P'uty'v 


respectively, a’, 6’, y' denoting the reciprocals of a, 8, y. If 7 is a linear function 
of p, the dyadics which as prefactor and postfactor change p into t may be written 
respectively 

Soa ents S 

|p p| 
If 7 is any function of p, the dyadics which as prefactor and postfactor change dp into 
dr may be written respectively 


eee ce 
|dp dp| 
In the notation of the following chapter the second of these (when p denotes a posi- 


tion-vector) would be written Vr. The triadic which as prefactor changes dp into 


dr : d?r : 
\dp may be written ag? and that which as postfactor changes dp into a may be 


written TP The latter would be written VV7 in the notations of the following chapter. 


3 
; 
j 
4 
1, 
J 
r 
‘ 
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CHAPTER IV. 
(SUPPLEMENTARY TO CHAPTER II.) 


CONCERNING THE DIFFERENTIAL AND INTEGRAL CALCULUS OF VECTORS, 


159. If w is a vector having continuously varying values in space 
and p the vector determining the position of a point, we may set 


> 


p=xit+yj+zk, 
dp=dei+dyj+dzk, 
and regard w as a function of p, or of a, y, and z. Then, 


dw dw dw 
do=da + og eager . 


iat is eek {é ty a+koe 
-dw. .dw . , do 
de fay oa 


dw=dp.Ve. 


ore! d 
exactly as in No. 52, except that it is here applied to a vector and 
produces a dyadic, while in the former case it was applied to a 
scalar and produced a vector. The dyadic Vw represents the nine 
differential coefficients of the three components of » with respect to 
x,y, and z, just as the vector Vu (where w is a scalar function of p) 
represents the three differential coefficients of the scalar w with 
respect to «, y, and z. 

It is evident that the expressions V.w and Vxw already defined 
(No. 54) are equivalent to {Vw}; and {Vo},. 

160. An important case is that in which the vector operated on is 
of the form Vu. We have then 


If we set Vo=2 


Here V stands for 


dVu=dp.VVu, 
where 
aii a teayil a tes wh 
as +araait + Fai + apasi® 
+aeank + ae ki + pe Bh | 


This dyadic, which is evidently self-conjugate, represents the six 
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eee coefficients of the second order of w with Ree to @, Y, 
and z.* 

161. The operators Vx and V. may be applied to ioudine in a 
manner entirely analogous to their use with scalars. Thus we may 
define Vx® and V.® by the equations 


dd, dé, dé 
Mas ile are ae 


dé .d® ap 
V.P=1. da ark ‘dy thay dz 
Then, if P=ait+hj+ yk, 


Vx@=VxaitVxBj+Vxyk. 
V.b=V.aitV.Bj+V.yk. 


Oni Sic ya tb, 
Pars: _48} [ae dy Es da 
UAee ‘la da lt) de de lt ae dy’ 
dB dy 
ee Tatdet a 


162. We may now regard V.V in expressions like V.Vw as repre- 

senting two successive operations, the result of which wiil be 

Po Po , Pe 

daz" dy?" dz 
in accordance with the definition of No. 70. We may also write 
V.V® for 

Pe Ph, Ph 

dat * dy? * dat’ 


although in this case we cannot regard V.V as representing two 
successive operations until we have defined V®.7 

That V.Vb=VV.6—VxVx® will be evident if we suppose ® to 
be expressed in the form ai+ 8j+yk. (See No. 71.) 

163. We have already seen that 


w’—w=fdp.Vu, | 
where w’ and w” denote the values of w at the beginning and the end 
of the line to which the integral relates. The same relation will hold 
for a vector ; i.e., 


wo’ —w =fdp Vo. 


“We might proceed to higher steps in differentiation by means of the triadics VVw, 
VVVu, the tetradics VVVw, VVVVu, etc. See note on page 74. In like manner a dyadic 


function of position in space (6) might be differentiated by means of the triadic V®, the 
tetradic VV®, etc. 


+See footnote to No. 160. 


a ee a a fe ee 
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164. The following equations between surface-integrals for a closed 
surface and volume-integrals for the space enclosed seem worthy of 
mention. One or two have already been given, and are here repeated 
for the sake of comparison. 


Sf dow=fffdo Vu, (1) 

Ldc w=f//fdv Vw, (2) 

Jf do.w =fff dv Vw, (3) 

Sf dco =f/fdvV.®, (4) 

Sfdoxw =/{//dv V xw, (5) 

S[dex®=/{fdu V x®. (6) 

It may aid the memory of the student to observe that the transfor- 


mation may be effected in each case by substituting ///dv V for f/de. 

165. The following equations between line-integrals for a closed 
line and surface-integrals for any surface bounded by the line, may 
also be mentioned. (One of these has already been given. See 


No. 60.) 
Sdpw=ffdexVu, (1) 
JSdp w={fdexVo, (2) 
Sdp.w ={f{do.V xo, (3) 
Sdp.2=/fdo.Vx®, (4) 


Sdpxw=ff Vo.de—ffdoV.w. (5) 
These transformations may be effected by substituting //[de x V] for 
Jdp. The brackets are here introduced to indicate that the multi- 
plication of de with the i,j, k implied in V is to be performed before 
any other multiplication which may be required by a subsequent sign. 
(This notation is not recommended for ordinary use, but only sug- 
gested as a mnemonic artifice.) 

166. To the equations in No. 65 may be added many others, as, 


V[uw)]=Vuo+uVo, (1) 
V[ 7x0] = Vrxw— Vox, (2) 
Vx[txo]=0.Vr—V.70—7.VotV.or7, (3) 
V(r.) =Vr.0+ Vo.7, (4) 
V.{rw} =V.70+7.Vo, (5) 
Vx{to} =Vxtw—7Tx Ve, (6) 
V.{ubh} =Vu.b+uvVo, (7) 
etc. 


The principle in all these cases is that if we have one of the operators 
V, V., Vx prefixed to a product of any kind, and we make any 
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transformation of the expression which would be allowable if the V 
were a vector (viz., by changes in the order of the factors, in the 
signs of multiplication, in the parentheses written or implied, etc.), by 
which changes the V is brought into connection with one particular 
factor, the expression thus transformed will represent the part of 
the value of the original expression which results from the variation 
of that factor. 

167. From the relations indicated in the last four paragraphs, may 
be obtained directly a great number of transformations of definite 
integrals similar to those given in Nos. 74-77, and corresponding to 
those known in the scalar calculus by the name of integration by 
parts. ; 

168. The student will now find no difficulty in generalizing the 
integrations of differential equations given in Nos. 78-89 by applying 
to vectors those which relate to scalars, and to dyadics those which 
relate to vectors. 

169. The propositions in No. 90 relating to minimum values of the 
volume-integral /{fuw.wdv may be generalized by substituting 
w.&.w for ww.o, & being a given dyadic function of position in space. 

170. The theory of the integrals which have been called potentials, 
Newtonians, etc. (see Nos. 91-102) may be extended to cases in which 
the operand is a vector instead of a scalar or a dyadic instead of a 
vector. So far as the demonstrations are concerned, the case of a 
vector may be reduced to that of a scalar by considering separately 
its three components, and the case of a dyadic may be reduced to that 
of a vector, by supposing the dyadic expressed in the form ¢i+yj+ok 
and considering each of these terms separately. 


CHAPTER V. 


CONCERNING TRANSCENDENTAL FUNCTIONS OF DYADICS. 


171. Def—The exponential function, the sine and the cosine of a 
dyadic may be defined by infinite series, exactly as the corresponding 
functions in scalar analysis, viz., 

e®=I4+ $416? +5,.85+4 ote, 
sin 6=OG—-FP +5554 P' —ete, 
cos 6 =[ —1?4 4 H'~ ete, 
These series are always convergent. For every value of @ there is 
one and only one value of each of these functions. The exponential 
function may also be defined as the limit of the expression 


(48) 


——_——_——— = 


a 


Ps 


a 


Nites 
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when N, which is a whole number, is increased indefinitely. That 
this definition is equivalent to the preceding, will appear if the 
expression is expanded by the binomial theorem, which is evidently 
applicable in a case of this kind. 
These functions of 6 are homologous with &. 
172. We may define the logarithm as the function which is the 
inverse of the exponential, so that the equations 
ev=®, 
V=log $, 
are equivalent, leaving it undetermined for the present whether 
every dyadic has a logarithm, and whether a dyadic can have more 
than one. 
173. It follows at once from the second definition of the exponential 
function that, if @ and Y are homologous, 
ef ek =ebt¥, 


and that, if T is a positive or negative whole number, 


{e®}T = TP, 
174, If H and @ are homologous dyadics, and such that 
=. b= —, 


the definitions of No. 171 give immediately 
e®-?=cos6+ ez sin &, 


e-=-®—cos P—F sin ®, 
whence 
cos =} {e#?+¢e-=#}, 


sin P= —4B{e7*—e-=*}. 
175. If BV=V.S=0, 
{4+ VP=P?4 V%, {P+ V}P=hH7+V%, ete. 
Therefore ett ¥ — et +er—], 
cos{h+V¥} =cos 6+cos ¥ —I, 
sinf{@+V} =sin &+sin ¥. 
176. Gee 
le lee”. 
For the first member of this equation is the limit of 
{I+N-!}*|, that is, of |I+N-'S|*. 
If we set P=ai+(j+yk, the limit becomes that of 
(1+N-ta.it+N 16j+N-ly.b)®, or (1+N-16,)%, 
the limit of which is the second member of the equation to be proved. 
177. By the definition of exponentials, the expression 
et thi —skh 


represents the limit of : 
: {1-4 gN-4(hj —Jk})® 
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Now I+qN-1{kj—jk} evidently represents a versor having the axis 7 
and the infinitesimal angle of version gN-%. Hence the above ex- 
ponential represents a versor having the same axis and the angle of 
version g. If we set gi=, the exponential may be written 

eh, 


Such an expression therefore represents a versor. The axis and 
direction of rotation are determined by the direction of «, and the 
angle of rotation is equal to the magnitude of w. The value of the 
versor will not be affected by increasing or diminishing the magnitude 
of w by 27. 

178. If, as in No. 151, 


&=aaa' +b’ +ceyy’, 
the definitions of No. 171 give 
e&=etaa’+eBR +eyy’, 
cos ®=cos aaa’ + cosb BB’ + cosc yy’, 
sin @=sin aaa’ + sinb BS’ + sine yy’. 
If a, b, ¢ are positive and unequal, we may add, by No. 172, 
log P=log aaa’ + log b BB’ + loge yy’. 
179. If, as in No. 158, 
 =aaa’ +b(BB'+yy)} +efy8’— By} 
=aaa'+ pcos q{BB'+yy'}+ psing{yB'—By'}, 
we have by No. 173 5a paca’ gb{ep’tyy'} _ eclys- ey}. 


But ere! = etaa'+ BB+ yy’; 

ebtPe'ty} = aa’ +e{ BB’ + yy’}, 

erie’ -Py} = aa’ + cosc{ BB’ +yy'} + sinc {y—’— By}. 
Therefore, 


e®=etaa’ + e?cose{BO'+yy'} + esine{y—’—By’}. 
Hence, if a is positive, 

log =log a aa’+log p{BB'+yy'}+q{yB' — By'}- 
Since the value of @ is not affected by increasing or diminishing q by 


27, the function log ® is many-valued. 
To find the value of cos® and sin ®, let us set 


O=b{BB' + yy} +¢{yB'—By'}, 
Then, by No, 175, phot ahs 4 ¢ 
cos ®= cos {aaa’} +cos 0-1. 


But 
cos {aaa’} — I =cos aaa’ — aa’. 


Therefore 
? cos ® = cos aaa’ — aa’+cos O. 
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Now, by No. 174, | 
cos O= 4} {c%© 4 ¢-=- 6}, 
Si Y : : : 
ree B.O= —c{B8+yy'}+b{yp'—By'}, 
e=-© = aa’ + e-°cosb {BB’+yy'} + e-*sin b {yB'—By’}, 
e-*-© =aa’ + e°cosb {B8'+yy'} — e’sinb {y8’— By’}. 


, Therefore 
| 228 O=aal + H(e 46-4) cos} (88'+ yy} — Ker—e-*)sind (y6'—By/), 
4 and 
j cos ®=cos aaa’ + 3(e°+e-°) cosb {BB'+yy'} — b(e°—e-°) sin b{y8’— By}. 
§ In like manner we find 
4 sin @=sin aaa’ + }(e°+e-°)sin b {88’+yy'} + He°—e-°) cosb {yB8'’— By’}. 
, 180. Ifa, B, y and a’, 0’, y’ are reciprocals, and 
5 b=aaa'+b{BBi+yy'}+e By’, 
F and N is any whole number, 
@N = gaa’ +b*{ BB’ +yy'}+Nb%—'c By’. 
Therefore, e? =etaa’ +e°{ BB’ +yy'}+ec By’, 
cos 6=cos a aa’+cos b {88’+yy'} —esinb By, 
sin ®=sin a aa’+sin b {BB’+yy'}+ecosb By’. 
; If a and b are unequal, and c other than zero, we may add 


4 log 6=log a aa’+log b{ BB’ + yy'}+cb-1By. 
| | 181. If a, 8, y, and a’, 8’, y’ are reciprocals, and — 

: b=al+b{ap'+By'}+eay, 

| and N is a whole number, 

@N — aI + Na®b{a8'+ By} +(NaX e+ 4N(N—1) a8) ay’. 
Therefore 
4 e =e 1+e%b{aB'+ By} +e (40? +e)ay’, 

cos P=cosal—bsina{a8’'+ Ba} —(46?cosa+csina)ay’, 
sin @=sin al+bcosa{al’+8a’} —(4b?sin a—ccosa)ay’. 
Unless b=0, we may add 

log = log aI+ba-*{a8’+ Ba'} +(ca-*— Zb’a~*)ay’. 

182. If we suppose any dyadic @ to vary, but with the limitation 
that all its values are homologous, we may obtain from the definitions 


of No. 171 d{e*} =e?.de, (1) 
dsin &=cos ®.d®, (2) 
dcos = —sin &.d®, (3) 
dlog b=@-1.d®, (4) 
G. IL. F 
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as in the ordinary calculus, but we must not apply these equations to 
cases in which the values of @ are not homologous. 

183. If, however, I’ is any constant dyadic, the variations of t© will 
necessarily be homologous with ¢I', and we may write without other 
limitation than that I’ is constant, 


dter} pr 1 
eri = ]'.6 (1) 
conte =I. cos{zD}, (2) 
feos (hh —I.sin {tT}, (3) 
d if ah ee 
ia ee (4) 
A second differentiation gives 
1T 
yale } =T%.e, (5) 
2 
Com CE} 12 sin (eT), (6) 
2 
seat Be —I?. cos {tT}. (7) 
184. It follows that if we have a differential equation of the form 
dp 
Gale 
the integral equation will be of the form 
p= eo. p: 


p representing the value of p fort=0. For this gives 


do _ iT p 
at =l.e".p'=Ip, 


and the proper value of p for t=0. 

185. Def—A flux which is a linear function of the position-vector 
is called a homogeneous-strain-fluz from the nature of the strain 
which it produces. Such a flux may evidently be represented bya 
dyadic. 

In the equations of the last paragraph, we may suppose p to 
represent a position-vector, ¢ the time, and T a homogeneous-strain- 
flux. Then ¢” will represent the strain produced by the flux T in 
the time ¢. 

In like manner, if A represents a homogeneous strain, {log A}/t 


will represent a homogeneous-strain-flux which would produce the 
strain A in the time ¢. 
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186. If we have : 
dp _1e 
ag Le 
where I’ is complete, the integral equation will be of the form 


p=e'.a + e~ 8. 


For this gives d 
paliea — Te". 8, 
oe =[%e wr ns + T2.e ~® B=I%.p, 
4 and a and 8 may be determined so as to satisfy the equations 
: Pro= at 8B, 
: ad 
: f|_=Ta-8) 
4 187. The differential equation 
. a2 
| qa le 


will be satisfied by 
p=cos{tl}.a + sin{tT}.8, 


whence d 

: = —F-sin {tT }.a + D.cos {tl}.8, 

; dp 2 2 sinftl}.@= —I? 
qe T° cos {tT}.a — M™.sintt }.8=—I™%p. 
; If T' is complete, the constants a and @ may be determined to satisfy 
: the equations 

Pt=0 

i 

.- a2 

* dt pe B 

| - 

] ot 2 ME 

188. If Te {T?—A?}.p, 

‘ where I?— A? is a complete dyadic, and 

. T.A=A.T=0, 


we may set 
p={he™+4he- "+ cos {tA} — ripe a+{te™—4e~*+sin {tA} }.8, 
which gives 
© = (re —AT.e— A. sin {tA} }.a 
+ {40 .e7 +40 .e- "+ A.cos {tA} }.8, 
= {467 +40%.e°7— A cos {tA} }.a 


af 
+ {412.e —41%.e-" — A2 sin {tA} }.8 


= {I?—A%}.9. 
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The constants a and 6 are to be determined by 


Pi=0—% 
B =(T+A}86. 
dt \,=0 

189, It will appear, on reference to Nos. 155-157, that every 
complete dyadic may be expressed in one of three forms, viz., as @ 
square, as a square with the negative sign, or as a difference of squares 
of two dyadics of which both the direct products are equal to zero. 
It follows that every equation of the form 


where 9 is any constant and complete dyadic, may be integrated by 
the preceding formule. 


NOTE ON BIVECTOR ANALYSIS.* 


1. A vector is determined by three algebraic quantities. It often 
occurs that the solution of the equations by which these are to be 
determined gives imaginary values, ie., instead of scalars we obtain 
biscalars, or expressions of the form a+.b, where a and 6 are scalars, 
and .=a/—1. It is most simple, and always allowable, to consider 
the vector as determined by its components parallel to a normal 
system of axes. In other words, a vector may be represented in the 


form xt + yj + 2k. 


Now if the vector is required to satisfy certain conditions, the solution 
of the equations which determine the values of a, y, and z, in the 
most general case, will give results of the form 


L= H+ lL, 
Y=Yt Ye, 
2=2,+%, 


*Thus far, in accordance with the purpose expressed in the footnote on page 17, we 
have considered only real values of scalars and vectors. The object of this limitation 
has been to present the subject in the most elementary manner. The limitation is 
however often inconvenient, and does not allow the most symmetrical and complete 
development of the subject in many important directions. Thus in Chapter V, and the 
latter part of Chapter III, the exclusion of imaginary values has involved a considerable 
sacrifice of simplicity both in the enunciation of theorems and in their demonstration. 
The student will find an interesting and profitable exercise in working over this part of 
the subject with the aid of imaginary values, especially in the discussion of the 
imaginary roots of the cubic equation on page 71, and in the use of the formula 


e? =cosd+isin& 


in developing the properties of the sines, cosines, and exponentials of dyadics. 
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where 2, 2, Y1, Yo, 2%, 2. are scalars. Substituting these values in 


xi+ yj +2k, 
we obtain (& + t@g)0+(Y, + Yo) J +(%, + e2Q)k; 
or, if we set Pr=Xyt+ yj + 2k, 


P2= Ut t Yo) + 24h, 
we obtain Pit tps: 


We shall call this a bivector, a term which will include a vector as a 
particular case. When we wish to express a bivector by a single 
letter, we shall use the small German letters. Thus we may write 


T= pt tps. 
An important case is that in which p, and p, have the same direction. 
The bivector may then be expressed in the form (a@+:b)p, in which 
the vector factor, if we choose, may be a unit vector. In this case, we 
may say that the bivector has a real direction. In fact, if we express 
the bivector in the form 
(@ +g) 1+(Y, + 1Yo)I + (A +1%) kb 

the ratios of the coefficients of 2, 7, and k, which determine the direc- 
tion cosines of the vector, will in this case be real. 

2. The consideration that operations upon bivectors may be regarded 
as operations upon their biscalar x-, y- and z-components is sufficient 
to show the possibility of a bivector analysis and to indicate what its 
rules must be. But this point of view does not afford the most simple 
conception of the operations which we have to perform upon bivectors. 
It is desirable that the definitions of the fundamental operations should 
be independent of such extraneous considerations as any system of 
axes. 

The various signs of our analysis, when applied to bivectors, may 
therefore be defined as follows, viz., 

The bivector equation 

hte’ =p" +0" 
implies the two vector equations 
w= os and fi = Fea 
—[utw)]=—u+d—y]. 
[ol to] 4 [oe +0] =[e +e] tel +01" 
[um +e] f [ue te] — [u’. be y. Y|+c[u’. y+ he ue’. 
[ul beo’)x[u" +) = [0 xm" vo xv" tele xy by xe) 


* (a+b) [w+ o]=ap — by + cLav + by]. 
[ut w](a+cb)=po — vb + [ub + ra]. 
Therefore the position of the scalar factor is indifferent. [MS. note by author. ] 
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With these definitions, a great part of the laws of vector analysis 
may be applied at once to bivector expressions. But an equation 
which is impossible in vector analysis may be possible in bivector 
analysis, and in general the number of roots of an equation, or of the 
values of a function, will be different according as we recognize, 
or do not recognize, imaginary values. 

3. Def—Two bivectors, or two biscalars, are said to be conjugate, 
when their real parts are the same, and their imaginary parts differ 
in sign, and in sign only. 

Hence, the product of the conjugates of any number of bivectors 
and biscalars is the conjugate of the product of the bivectors and 
biscalars. This is true of any kind of product. 

The products of a vector and its conjugate are as follows: 


[utey].[u-ol=putyer 
[utw]x[u—ov)]=20xp 
[ut] [u—o)] = {unt} +ef{vu— pr}. 


Hence, if « and w represent the real and imaginary parts of a 
bivector, the values of 


Me Y.D, MXY, be+ vv, Vik — [LV, 


are not affected by multiplying the bivector by a biscalar of the form 
a+ub, in which a?+b?=1, say a cyclic scalar. Thus, if we set 


we +0’ =(a+.b)[utu], 
we shall have 


pe —u'=(a—0b)[n—w), 


[wie]. [eo —w’]=[n +0]. [uo]. 


and 


That is, 
Mia by =p tv.v; 
and so in the other cases. 

4. Def—In biscalar analysis, the product of a biscalar and its conju- 
gate is a positive scalar. The positive square root of this scalar is 
called the modulus of the biscalar. In bivector analysis, the direct 
product of a bivector and its conjugate is, as seen above, a positive 
scalar. The positive square root of this scalar may be called the 
modulus of the bivector. When this modulus vanishes, the bivector 
vanishes, and only in this case. If the bivector is multiplied by a 
biscalar, its modulus is multiplied by the modulus of the biscalar. 
The conjugate of a (real) vector is the vector itself, and the modulus 
of the vector is the same as its magnitude. 

5. Def—If between two vectors, a and 8, there subsists a relation 
of the form 

a=nfp, 
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where 7 is a scalar, we say that the vectors are parallel. Analogy 
leads us to call two bivectors parallel, when there subsists between 
them a relation of the form 
a= mb, 
where m (in the most general case) is a biscalar. 
To aid us in comprehending the geometrical signification of this 
relation, we may regard the biscalar as consisting of two factors, one 


_ of which is a positive scalar (the modulus of the biscalar), and the 


other may be put in the form cosg+ sing. The effect of multiplying 
a bivector by a positive scalar is obvious. To understand the effect of 
a multiplier of the form cos q+ sing upon a bivector u+u, let us set 


uw +u/’ =(cos g+esin g) [ute]. 
We have then 
fe =COSG M — Sin g », 
v=cosgv+ sing ML. 


Now if uw and »y are of the same magnitude and at right angles, the 
effect of the multiplication is evidently to rotate these vectors in 
their plane an angular distance g, which is to be measured in the 
direction from vy to uw. In any case we may regard yu and vy as the pro- 
jections (by parallel lines) of two perpendicular vectors of the same 
length. The two last equations show that yp’ and vy’ will be the 
projections of the vectors obtained by the rotation of these perpendi- 
cular vectors in their plane through the angle g. Hence, if we 
construct an ellipse of which yw and v are conjugate semi-diameters, pu’ 
and y’ will be another pair of conjugate semi-diameters, and the sectors 
between p and yp’ and between vy and 1’, will each be to the whole area 
of the ellipse as g to 27, the sector between v and v’ lying on the same 
side of vy and yw, and that between uw and yw’ lying on the same side 
of was —». 
It follows that any bivector w+. may be put in the form 


(cos g+csin g) [a+], 


in which a and 8 are at right angles, being the semi-axes of the ellipse 
of which mu and »y are conjugate semi-diameters. This ellipse we may 
call the directional ellipse of the bivector. In the case of a real 
vector, or of a vector having a real direction, it reduces to a straight 
line. In any other case, the angular direction from the imaginary to 
the real part of the bivector is to be regarded as positive in the ellipse, 
and the specification of the ellipse must be considered incomplete 
without the indication of this direction. 

Parallelism of bivectors, then, signifies the similarity and similar 
position of their directional ellipses. Similar position includes iden- 
tity of the angular directions mentioned above. 
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6. To reduce a given bivector t to the above form, we may set 
t.t=(cos g+cesin q)?[a+¢B].[a+¢2] 
=(cos 2g +c sin 2q) (a.a—8.8) 
=a-+ub, 
where a and 0} are scalars, which we may regard as known. The 
value of gq may be determined by the equation 
b 
tan 2q= - 
the quadrant to which 2q belongs being determined so as to give 
sin 2q and cos2q the same signs as b and a. Then a and 8 will be 
given by the equation 
a+13=(cos qg—«sin g)t. 
The solution is indeterminate when the real and imaginary parts of 
the given bivector are perpendicular and equal in magnitude. In this 
case the directional ellipse is a circle, and the bivector may be called 
circular. The criterion of a circular bivector is 


ieee 


It is especially to be noticed that from this equation we cannot 

conclude that 
+=0, 

as in the analysis of real vectors. This may also,be shown by ex- 
pressing ¥ in the form #i+yj+2k, in which a, y, z are biscalars. The 
equation then becomes 

w+y+2=0, 
which evidently does not require x, y, and z to vanish, as would be 
the case if only real values are considered. 

7. Def—We call two vectors p and o perpendicular when p.c=0. 
Following the same analogy, we shall call two bivectors r and 8 
perpendicular, when 

t.8=0. 
In considering the geometrical signification of this equation, we shall © 
first suppose that the real and imaginary components of r and 6 lie in 
the same plane, and that both r and 8 have not real directions. It is 
then evidently possible to express them in the form 


mla+8], — m’[a'+.8'], 


where m and m™’ are biscalar, a and 8 are at right angles, and a’ 
parallel with 6. Then the equation r.8=0 requires that 


B.8’=0, and a.@’+f.a’=0. 


This shows that the directional ellipses of the two bivectors are 
similar and the angular direction from the real to the imaginary 
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component is the same in both, but the major axes of the ellipses are 
perpendicular. The case in which the directions of r and 8 are real, 
forms no exception to this rule. 

It will be observed that every circular bivector is perpendicular to 
itself, and to every parallel bivector. 

If two bivectors, w+, u’+u’, which do not lie in the same plane 
are perpendicular, we may resolve “ and y into components parallel 
and perpendicular to the plane of yw’ and y’. The components perpen- 
dicular to the plane evidently contribute nothing to the value of 

[ute]. [p+], 
Therefore the components of 4 and vy parallel to the plane of yp’, v’, 
form a bivector which is perpendicular to p’+o’. That is, if two 
bivectors are perpendicular, the directional ellipse of either, projected 
upon the plane of the other and rotated through a quadrant in that 
plane, will be similar and similarly situated to the directional ellipse 
of the second. 

8. A bivector may be divided in one and only one way into parts 
parallel and perpendicular to another, provided that the second is not 
circular. Ifa and 6 are the bivectors, the parts of a will be 


P05 and apt, 
If 6 is circular, the resolution of a is impossible, unless it is perpen- 
dicular to b. In this case the resolution is indeterminate. 

9. Since axb.a=0, and axb.b=0,axb is perpendicular to a and b. 
We may regard the plane of the product as determined by the 
condition that the directional ellipses of the factors projected upon 
it become similar and similarly situated. The directional ellipse of 
the product is similar to these projections, but its orientation is 
different by 90°. It may easily be shown that axb vanishes only 
with a or 6, or when a and 6 are parallel. 

10. The bivector equation 


(axb.c)d—(b.cxd)a+(c.dxa)b—(d.axb)c=0 


is identical, as may be verified by substituting expressions of the 
form wit+yj+zk (x, y, 2 being biscalars), for each of the bivectors. 
(Compare No. 37.) This equation shows that if the product axb 
of any two bivectors vanishes, one of these will be equal to the other 
with a biscalar coefficient, that is, they will be parallel, according 
to the definition given above. If the product a.bxc of any three 
bivectors vanishes, the equation shows that one of these may be 
expressed as a sum of the other two with biscalar coefficients. In 
this case, we may say (from the analogy of the scalar analysis) that 
the three bivectors are complanar. (This does not imply that they 


> 
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lie in any same real plane.) If a.bxc is not equal to zero, the 
equation shows that any fourth bivector may be expressed as a sum 
of a, 6, and ¢ with biscalar coefficients, and indicates how these 
coefficients may be determined. 

11, The equation 


(r.a)bxc+(t.b)cxa+(t.c)axb=(axb.c)t 
is also identical, as may easily be verified. If we set 


c=axb, 


and suppose that chat banl 


the equation becomes 
(v.axb)axb=(axb.axb)r. 


This shows that if a bivector x is perpendicular to two bivectors 
a and b, which are not parallel, r will be parallel to axb. Therefore 
all bivectors which are perpendicular to two given bivectors are 
parallel to each other, unless the given two are parallel. 


[Note by Hditors.—The notation |#| &,~1, used on page 64, was later improved by the 
author by the introduction of his Double Multiplication, according to which the above 
expression is represented by $,, and |®| by ®,. See this volume, pages 112, 160, 
and 181. For an extended treatment of Professor Gibbs’s researches on Double Multi- 
plication in their application to Vector Analysis see pp. 306-321, and 333 of ‘‘ Vector 
Analysis,” by E. B. Wilson, Chas. Scribner’s Sons, New York, 1901.] 
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ON MULTIPLE ALGEBRA. 


ADDRESS BEFORE THE SECTION OF MaTHEMATICS AND ASTRONOMY OF THE AMERICAN 
ASSOCIATION FOR THE ADVANCEMENT OF SCIENCE, BY THE VICE-PRESIDENT. 


[Proceedings of the American Association for the Advancement of Science, 
vol. XXXV. pp. 37-66, 1886.] 


Ir has been said that “the human mind has never invented a labor- 
saving machine equal to algebra.”* If this be true, it is but natural 
and proper that an age like our own, characterized by the multi- 
plication of labor-saving machinery, should be distinguished by an 
unexampled development of this most refined and most beautiful of 
machines. That such has been the case, none will question. The 
improvement has been in every part. Even to enumerate the principal 
lines of advance would be a task for any one; for me an impossibility. 
But if we should ask, in what direction the advance has been made 
which is to characterize the development of algebra in our day, we 
may, I think, point to that broadening of its field and methods which 
gives us multiple algebra. 

Of the importance of this change in the conception of the office of 
algebra, it is hardly necessary to speak : that it is really characteristic 
of our time will be most evident if we go back some two or three- 
score years, to the time when the seeds were sown which are now 
yielding so abundant a harvest. The failure of Mobius, Hamilton, 
Grassmann, Saint-Venant to make an immediate impression upon 
the course of mathematical thought in any way commensurate with 
_the importance of their discoveries is the most conspicuous evidence 
that the times were not ripe for the methods which they sought to 
introduce. A satisfactory theory of the imaginary quantities of 
ordinary algebra, which is essentially a simple case of multiple 
algebra, with difficulty obtained recognition in the first third of this 
century. We must observe that this dowble algebra, as it has been 
called, was not sought for or invented ;—it forced itself, unbidden, 
upon the attention of mathematicians, and with its rules already 
formed. 


* The Nation, vol. xxxiii, p. 237. 
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But the idea of double algebra, once received, although as it were 
unwillingly, must have suggested to many minds more or less 
distinctly the possibility of other multiple algebras, of higher orders, 
possessing interesting or useful properties. 

The application of double algebra to the geometry of the plane 
suggested not unnaturally to Hamilton the idea of a triple algebra 
which should be capable of a similar application to the geometry of 
three dimensions. He was unable to find a satisfactory triple algebra, 
but discovered at length a quadruple algebra, quaternions, which 
answered his purpose, thus satisfying, as he says in one of his letters, 
an intellectual want which had haunted him at least fifteen years. 
So confident was he of the value of this algebra, that the same hour 
he obtained permission to lay his discovery before the Royal Irish 
Academy, which he did on November 13, 1843.* This system of 
multiple algebra is far better known than any other, except the 
ordinary double algebra of imaginary quantities——far too well known 
to require any especial notice at my hands. All that here requires 
our attention is the close historical connection between the imaginaries 
of ordinary algebra and Hamilton’s system, a fact emphasized by 
Hamilton himself and most writers on quaternions. It was quite 
otherwise with Mobius and Grassmann. 

The point of departure of the Barycentrischer Calcul of Mébius, 
published in 1827,—a work of which Clebsch has said that it can 
never be admired enough,t—is the use of equations in which the 
terms consist of letters representing points with numerical coefiicients, 
to express barycentric relations between the points. Thus, that the 
point S is the centre of gravity of weights, a, b, ¢, d, placed at the 
points A, B, C, D, respectively, is expressed by the equation 


(a+b+c+d)S=aA+bB+cC+dD. 
An equation of the more general form 
aA+bB+cC+etc.=pP+qQ+rR-+ete. 


signifies that the weights a, b, ¢, etc., at the points A, B, C, ete., have 
the same sum and the same centre of gravity as the weights », q, 7, 
etc., at the points P, Q, R, ete., or, in other words, that the former are 
barycentrically equivalent to the latter. Such equations, of which 
each represents four ordinary equations, may evidently be multiplied 
or divided by scalars,} may be added or subtracted, and may have 


* Phil. Mag. (3), vol. xxv, p. 490; North British Review, vol. xlv (1866), p. 57. 

+See his eulogy on Pliicker, p. 14, Gétt. Abhandl., vol. xvi. 

{I use this term in Hamilton’s sense, to denote the ordinary positive and negative 
quantities of algebra. It may, however, be observed that in most cases in which I shall 
have occasion to use it, the proposition would hold without exclusion of imaginary 
quantities,—that this exclusion is generally for simplicity and not from necessity. 


ee 
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their terms arranged and transposed, exactly like the ordinary 
equations of algebra. It follows that the elimination of letters 
representing points from equations of this kind is performed by the 
rules of ordinary algebra. This is evidently the beginning of a 
quadruple algebra, and is identical, as far as it goes, with Grassmann’s 
marvellous geometrical algebra. 

In the same work we find also, for the first time so far as I am 
aware, the distinction of positive and negative consistently carried 
out on the designation of segments of lines, of triangles, and of tetra- 
hedra, viz., that a change in place of two letters, in such expressions 
as AB, ABC, ABCD, is equivalent to prefixing the negative sign. It 
is impossible to overestimate the importance of this step, which gives 
to designations of this kind the generality and precision of algebra. 

Moreover, if A, B, C are three points in the same straight line, and 
D any point outside of that line, the author observes that we have 


AB+BC+CA=0, 
and also, with D prefixed, 
DAB+DBC+DCA=0. 


Again, if A, B, C, D are four points in the same plane, and # any 
point outside of that plane, we have 


ABC—BCD+CDA-—DAB=0, 
and also, with # prefixed, 
EABC— EBCD+ ECDA—EDAB=0. 


The similarity to multiplication in the derivation of these formule 
cannot have escaped the author's notice. Yet he does not seem to 
have been able to generalize these processes. It was reserved for the 
genius of Grassmann to see that AB might be regarded as the 
product of A and B, DAB as the product of D and AB, and HABC 
as the product of H and ABC. That Mobius could not make this step 
was evidently due to the fact that he had not the conception of the 
addition of other multiple quantities than such as may be represented 
by masses situated at points. Even the addition of vectors (ie., the 
fact that the composition of directed lines could be treated as an 
addition) seems to have been unknown to him at this time, although 
he subsequently discovered it, and used it in his Mechanik des 
Himmels, which was published in 1843, This addition of vectors, 
or geometrical addition, seems to have occurred independently to 
many persons. 

Seventeen years after the Barycentrischer Calcul, in 1844, the year 
in which Hamilton’s first papers on quaternions appeared in print, 
Grassmann published his Lineale Ausdehnungslehre, in which he 
developed the idea and the properties of the eaternal or combinatorial 
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product, a conception which is perhaps to be regarded as the greatest 
monument of the author’s genius. This volume was to have been 
followed by another, of the nature of which some intimation was 
given in the preface and in the work itself. We are especially told 
that the internal product,* which for vectors is identical except in 
sign with the scalar part of Hamilton’s product (just as Grassmann’s 
external product of two vectors is practically identical with the 
vector part of Hamilton’s product), and the open product,t which in 
the language of to-day would be called a matrix, were to be treated 
in the second volume. But both the internal product of vectors and 
the open product are clearly defined, and their fundamental properties 
indicated, in this first volume. 

This remarkable work remained unnoticed for more than twenty 
years, a fact which was doubtless due in part to the very abstract and 
philosophical manner in which the subject was presented. In con- 
sequence of this neglect the author changed his plan, and instead of a 
supplementary volume published in 1862 a single volume entitled 
Ausdehnungslehre, in which were treated, in an entirely different 
style, the same topics as in the first volume, as well as those which he 
had reserved for the second. 

Deferring for the moment the discussion of these topics in order to 
follow the course of events, we find in the year following the first 
Ausdehnungslehre a remarkable memoir of Saint-Venant t, in which 
are clearly described the addition both of vectors and of oriented 
areas, the differentiation of these with respect to a scalar quantity, 
and a multiplication of two vectors and of a vector and an oriented 
area. These multiplications, called by the author geometrical, are 
entirely identical with Grassmann’s external multiplication of the 
same quantities. 

It is a striking fact in the history of the subject, that the short 
period of less than two years was marked by the appearance of 
well-developed and valuable systems of multiple algebra by British, 
German, and French authors, working apparently entirely inde- 
pendently of one another. No system of multiple algebra had 
appeared before, so far as I know, except such as were confined to 
additive processes with multiplication by scalars, or related to the 
ordinary double algebra of imaginary quantities. But the appearance 
of a single one of these systems would have been sufficient to mark 
an epoch, perhaps the most important epoch in the history of the 
subject. 

In 1853 and 1854, Cauchy published several memoirs on what 
he called clefs algébriques.§ These were units subject generally to 

* See the preface. + See § 172. 

t Comptes Rendus, vol. xxi, p. 620. § Comptes Rendus, vols. xxxvi, ff. 
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combinatorial multiplication. His principal application was to the 
theory of elimination. In this application, as in the law of multi- 
plication, he had been anticipated by Grassmann. 

We come next to Cayley’s celebrated Memoir on the Theory of 
Matrices* in 1858, of which Sylvester has said that it seems to him 
to have ushered in the reign of Algebra the Second.+ I quote this 
dictum of a master as showing his opinion of the importance of the 
subject and of the memoir. But the foundations of the theory of 
matrices, regarded as multiple quantities, seem to me to have been 
already laid in the Awsdehnwngslehre of 1844. To Grassmann’s 
treatment of this subject we shall recur later. 

After the Ausdehnwngslehre of 1862, already mentioned, we come 
to Hankel’s Vorlesungen tiber die complexen Zahlen, 1867. Under 
this title the author treats of the imaginary quantities of ordinary 
algebra, of what he calls alternirende Zahlen, and of quaternions. 
These alternate numbers, like Cauchy’s clefs, are quantities subject 
to Grassmann’s law of combinatorial multiplication. This treatise, 
published twenty-three years after the first Ausdehnwngslehre, marks 
the first impression which we can discover of Grassmann’s ideas upon 
the course of mathematical thought. The transcendent importance 
of these ideas was fully appreciated by the author, whose very able 
work seems to have had considerable influence in calling the attention 
of mathematicians to the subject. 

In 1870, Professor Benjamin Peirce published his Linear Associative 
Algebra, subsequently developed and enriched by his son, Professor 
C. S. Peirce. The fact that the edition was lithographed seems to 
indicate that even at this late date a work of this kind could only 
be regarded as addressed to a limited number of readers. But the 
increasing interest in such subjects is shown by the republication of 
this memoir in 1881,¢ as by that of the first Ausdehnungslehre in 
1878. 

The article on quaternions which has just appeared in the Encyclo- 
pedia Britannica mentions twelve treatises, including second editions 
and translations, besides the original treatises of Hamilton. That all 
the twelve are later than 1861 and all but two later than 1872 shows 
the rapid increase of interest in this subject in the last years. 

Finally, we arrive at the Lectures on the Principles of Universal 
Algebra by the distinguished foreigner whose sojourn among us has 
given such an impulse to mathematical study in this country. The 
publication of these lectures, commenced in 1884 in the American 
Journal of Mathematics, has not as yet been completed,—a want but 
imperfectly supplied by the author's somewhat desultory publication 

* Phil. Trans., vol. cxlviii. + Amer. Journ. Math., vol. vi, p. 271. 

+ Amer. Journ. Math., vol. iv. 


96 MULTIPLE ALGEBRA. 


of many remarkable papers on the same subject (which might be 
more definitely expressed as the algebra of matrices) in various 
foreign journals. 


It is not an accident that this century has seen the rise of multiple 
algebra. The course of the development of ideas in algebra and in 
geometry, although in the main independent of any aid from this 
source, has nevertheless to a very large extent been of a character 
which can only find its natural expression in multiple algebra. 

Our Modern Higher Algebra is especially occupied with the theory 
of linear transformations. Now what are the first notions which we 
meet in this theory? We have a set of m variables, say 2, y, 2, and 
another set, say 2’, y’, 2’, which are homogeneous linear functions of 
the first, and therefore expressible in terms of them by means of a 
block of n? coefficients. Here the quantities occur by sets, and invite 
the notations of multiple algebra. It was in fact shown by Grass- 
mann in his first Ausdehnwngslehre and by Cauchy nine years later, 
that the notations of multiple algebra afford a natural key to the 
subject of elimination. 

Now I do not merely mean that we may save a little time or space 
by writing perhaps p for x, y and z; p’ for a’, y’ and 2; and ® fora 
block of n? quantities. But I mean that the subject as usually treated 
under the title of determinants has a stunted and misdirected develop- 
ment on account of the limitations of single algebra. This will appear 
from a very simple illustration. After a little preliminary matter 
the student comes generally to a chapter entitled “ Multiplication of 
Determinants,” in which he is taught that the product of the deter- 
minants of two matrices may be found by performing a somewhat 
lengthy operation on the two matrices, by which he obtains a third 
matrix, and then taking the determinant of this. But what signifi- 
cance, what value has this theorem? For aught that appears in the 
majority of treatises which I have seen, we have only a complicated 
and lengthy way of performing a simple operation. The real facts 
of the case may be stated as follows: 

Suppose the set of m quantities p’ to be derived from the set p by 
the matrix &, which we may express by 


p=2.p; 

and suppose the set p” to be derived from the set p’ by the matrix WV, ie., 
p- — vy, ie 

and p =v. p; 


” 


it is evident that p” can be derived from p by the operation of a 
single matrix, say 9, ie., 

p' =9.p, 
so that O=V.¢4. 


MULTIPLE ALGEBRA. 97 


In the language of multiple algebra © is called the product of 
Y and , It is of course interesting to see how it is derived from 
the latter, and it is little more than a schoolboy’s exercise to determine 
this. Now this matrix © has the property that its determinant 
is equal to the products of the determinants of V and @, And this 
property is all that is generally stated in the books, and the funda- 
mental property, which is all that gives the subject its interest, that @ 
is itself the product of Y and © in the language of multiple algebra, 
Le., that operating by © is equivalent to operating successively by 
and Y, is generally omitted. The chapter on this subject, in most 
treatises which I have seen, reads very like the play of Hamlet with 
Hamlet's part left out. 

And what is the cause of this omission? Certainly not ignorance 
of the property in question. The fact that it is occasionally 
given would be a sufficient bar to this answer. It is because 
the author fails to see that his real subject is matrices and not 
determinants. Of course, in a certain sense, the author has a right 
to choose his subject. But this does not mean that the choice 
is unimportant, or that it should be determined by chance or by 
eaprice. The problem well put is half solved, as we all know. 
If one chooses the subject ill, it will develop itself in a cramped 
manner. 

But the case is really much worse than I have stated it. Not only 
is the true significance of the formation of 6 from V and © not given, 
but the student is often not taught to form the matrix which is the 
product of VY and ©, but one which is the product of one of these 
matrices and the conjugate of the other. Thus the proposition which 
is proved loses all its simplicity and significance, and must be recast 
before the instructor can explain its true bearings to the student. 
This fault has been denounced by Sylvester, and if anyone thinks 
I make too much of the standpoint from which the subject is viewed, 
I will refer him to the opening paragraphs of the “Lectures on 
Universal Algebra” in the sixth volume of the American Journal of 
Mathematics, where, with a wealth of illustration and an energy 
of diction which I cannot emulate, the most eloquent of mathe- 
maticians expresses his sense of the importance of the substitution 
of the idea of the matrix for that of the determinant. If then so 
important, why was the idea of the matrix let slip? Of course the 
writers on this subject had it to commence with. One cannot even 
define a determinant without the idea of a matrix. The simple fact is 
that in general the writers on this subject have especially developed 
those ideas which are naturally expressed in simple algebra, and have 
postponed or slurred over or omitted altogether those ideas which 
find their natural expression in multiple algebra. But in this subject 

G. Il. G 
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the latter happen to be the fundamental ideas, and those which ought 
to direct the whole course of thought. 

I have taken a very simple illustration, perhaps the very first 
theorem which meets the student after those immediately connected 
with the introductory definitions, both because the simplest illustration 
is really the best, and because I am here most at home. But the 
principles of multiple algebra seem to me to shed a flood of light into 
every corner of the subjects usually treated under the title of deter- 
minants, the subject gaining as much in breadth from the new notions 
as in simplicity from the new notations; and in the more intricate 
subjects of invariants, covariants, etc., I believe that the principles of 
multiple algebra are ready to perform an equal service. Certainly 
they make many things seem very simple to me which I should 
otherwise find difficult of comprehension. 

Let us turn to geometry. 

If we were asked to characterize in a single word our modern 
geometry, we would perhaps say that it is a geometry of position. 
Now position is essentially a multiple quantity, or, if you prefer, is 
naturally represented in algebra by a multiple quantity. And the 
growth in this century of the so-called synthetic as opposed to 
analytical geometry seems due to the fact that by the ordinary 
analysis geometers could not easily express, except in a cumbersome 
and unnatural manner, the sort of relations in which they were par- 
ticularly interested. With the introduction of the notations of multiple 
algebra, this difficulty falls away, and with it the opposition between 
synthetic and analytical geometry. 

It is, however, interesting and very instructive to observe how the 
ingenuity of mathematicians has often triumphed over the limitations 
of ordinary algebra. A conspicuous example and one of the simplest 
is seen in the Mécanique Analytique, where the author, by the use 
of what are sometimes called indeterminate equations, is able to write 
in one equation the equivalent of an indefinite number. Thus the 
equation 


X dx+ Ydy+Zdz=0, 
by the indeterminateness of the values of da, dy, dz, is made equiva- 
lent to the three equations 

NE = ‘@) ; Y= 0 > Z => 0. 
It is instructive to compare this with 

AXti+¥j+Zk=0, 
which is the form that Hamilton or Grassmann would have used. 
The use of this analytical artitice, if such it can be called, runs all 
through the work and is fairly characteristic of it. 
Again, the introduction of the potential in the theory of gravity, or 
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electricity, or magnetism, gives us a scalar quantity instead of a vector 
as the subject of study; and in mechanics generally the use of the 
force-function substitutes a simple quantity for a complex. This 
method is in reality not different from that just mentioned, since 
Lagrange’s indeterminate equation expresses, at least in its origin, 
the variation of the force-function. It is indeed the real beauty of 
Lagrange’s method that it is not so much an analytical artifice, as the 
natural development of the subject. 

In modern analytical geometry we find methods in use which are 
exceedingly ingenious, and give forms curiously like those of multiple 
algebra, but which, at least if logically carried out very far, are exces- 
sively artificial, and that for the expression of the simplest things. 
The simplest conceptions of the geometry of three dimensions are 
points and planes, and the simplest relation between these is that a 
point lies in a plane. Let us see how these notions have been handled 
by means of ordinary algebra, and by multiple algebra. It will illus- 
trate the characteristic difference of the methods, perhaps as well as 
the reading of an elaborate treatise. 

In multiple algebra a point is designated by a single letter, just as 
it is in what is called synthetic geometry, and as it generally is by the 
ordinary analyst when he is not writing equations. But in his 
equations, instead of a single letter the analyst introduces several 
letters (coordinates) to represent the point. 

A plane may be represented in multiple algebra as in synthetic 
geometry by a single letter; in the ordinary algebra it is sometimes 
represented by three coordinates, for which it is most convenient to 
take the reciprocals of the segments cut off by the plane on three axes. 
But the modern analyst has a more ingenious method of representing 
the plane. He observes that the equation of the plane may be written 


£a+nyt+z=1, (1) 
where & 7, ¢ are the reciprocals of the segments, and 2, y, 2 are the 
coordinates of any point in the plane. Now if we set 


p=Eatnyt cz, (2) 
this letter will represent an expression which represents the plane. In 
fact, we may say that p implicitly contains €, 7, and ¢, which are the 
coordinates of the plane. We may therefore speak of the plane p, and 
for many purposes can introduce the letter p into our equations instead 
of €,9, & For example, the equation 


iy: _p +” 5 
| ge Dy) ( ) 
is equivalent to the three equations 
tt : + ( La nf ah n lee — & ae ce r 4 
eae ale Ne Tire, wi = 5) (4) 
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It is to be noticed that on account of the indeterminateness of the 
x, y, and 2, this method, regarded as an analytical artifice, is identical 
with that of Lagrange, also that in multiple algebra we should have 
an equation of precisely the same form as (3) to express the same 
relation between the planes, but that the equation would be explained 
to the student in a totally different manner. This we shall see more 
particularly hereafter. 

It is curious that we have thus a simpler notation for a plane than 
for a point. This however may be reversed. If we commence with 
the notion of the coordinates of a plane, ¢, 9, ¢ the equation of a 
point (i.e., the equation between £, 9, ¢ which will hold for every 
plane passing through the point) will be 


wétynt+e=l, (5) 
where a, y, 2 are the coordinates of the point. Now if we set 
qa eet yn 2, (6) 


we may regard the single letter g as representing the point, and use 
it, in many cases, instead of the coordinates a, y, z, which indeed it 
implicitly contains. Thus we may write 
oF Fe Mt 
re (7) 
for the three equations 


atin +a pint ae 


ji OR 
7? eee Kee tee (8) 


Here, by an analytical artifice, we come to equations identical in 
form and meaning with those used by Hamilton, Grassmann, and even 
by Mobius in 1827. But the explanations of the formule would 
differ widely. The methods of the founders of multiple algebra are 
characterized by a bold simplicity, that of the modern geometry by a 
somewhat bewildering ingenuity. That p and q represent the same 
expression (in one case «, y, 2, and in the other €, 7, € being indeter- 
minate) is a circumstance which may easily become perplexing. I am 
not quite certain that it would be convenient to use both of these 
abridged notations at the same time. In fact, if the geometer using 
these methods were asked to express by an equation in p and q that 
the point q lies in the plane p, he might find himself somewhat 
entangled in the meshes of his own ingenuity, and need some new 
artifice to extricate himself. I do not mean that his genius might 
not possibly be equal to the occasion, but I do mean very seriously 
that it is a vicious method which requires any ingenuity or any 
artifice to express so simple a relation. 

If we use the methods of multiple algebra which are most com- 
parable to those just described, a point is naturally represented by a 
vector (p) drawn to it from the origin, a plane by a vector (¢) drawn 
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from the origin perpendicularly toward the plane and in length equal 
to the reciprocal of the distance of the plane from the origin, The 
equation rt gl! 
Ge (9) 


will have precisely the same meaning as equation (3), and 


LA ‘+ ro 
pr aete (10) 


will have precisely the same meaning as equation (7), viz., that the 
point p” is in the middle between p’ and p’”. That the point p lies in 
the plane o is expressed by equating to unity the product of p and 
called by Grassmann internal, or by Hamilton called the scalar part 
of the product taken negatively. By whatever name called, the 
quantity in question is the product of the lengths of the vectors and 
the cosine of the included angle. It is of course immaterial what 
particular sign we use to express this product, as whether we write 
po=l, or Spo=—l. (11) 
I should myself prefer the simplest possible sign for so simple a 
relation. It may be observed that p and o may be expressed as the 
geometrical sum of their components parallel to a set of perpendicular 
axes, Viz., 
p=xityjt+zk, o=€r+nj+gk. (12) 
By substitution of these values, equation (11) becomes by the laws of 
this kind of multiplication 
eE+yn+z2f=1. (13) 

My object in going over these elementary matters is to call attention 
to the very roundabout way in which the ordinary analysis makes 
out to represent a point or a plane by a single letter, as distinguished 
from the directness and simplicity of the notations of multiple algebra, 
and also to the fact that the representations of points and planes by 
single letters in the ordinary analysis are not, when obtained, as 
amenable to analytical treatment as are the notations of multiple 
algebra. 

I have compared that form of the ordinary analysis which relates 
to Cartesian axes with a vector analysis. But the case is essentially 
the same if we compare the form of ordinary analysis which relates 
to a fundamental tetrahedron with Grassmann’s geometrical analysis, 
founded on the point as the elementary quantity. 

In the method of ordinary analysis a point is represented by four. 
coordinates, of which each represents the distance of the point from 
a plane of the tetrahedron divided by the distance of the opposite 
vertex from the same plane. The equation of a plane may be put 


in the form 
Extny+(z+ow=0, se 
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where £, 7, & w are the distances of the plane from the four points, 
and #, y, 2, w are the coordinates of any point in the plane. Here 
we may set 

p=extnytietow, (15) 
and say that p represents the plane. To some extent we can intro- 
duce this letter into equations instead of €, 7, & w. Thus the equation 


lp’ +mp" +np” =0 (16) 
(which denotes that the planes p’, p’, p’”, meet in a common line, 


making angles of which the sines are proportional to /, m, and 7) is 
equivalent to the four equations 


l&+mé’+n€é" =0, Ly +m’ +n” =0, ete. (17) 


Again, we may regard &, 7, ¢, w as the coordinates of a plane. The 
equation of a point will then be 


cE+ynt+z(+wo=0. (18) 
If we set 
q=eétyntzc+ we, (19) 
we may say that q represents the point. The equation 
mw “4 Z 
gui, (20) 


which indicates that the point q’” bisects the line between q’ and q’, 
is equivalent to the four equations 


, € +E" ,_ oy” 
ee ee ek 


n 7 ete. (21) 

To express that the point q lies in the plane p does not seem easy, 
without going back to the use of coordinates. 

The form of multiple algebra which is to be compared to this is the 
geometrical algebra of Mobius and Grassmann, in which points without 
reference to any origin are represented by single letters, say by Italic 
capitals, and planes may also be represented by single letters, say by 
Greek capitals. An equation like 


* Wes (22) 


has exactly the same meaning as equation (20) of ordinary algebra. So 
LIl’+mIl’+n”=0 (23) 


has precisely the same meaning as equation (16) of ordinary algebra. 
That the point @ lies in the plane II is expressed by equating to zero 
the product of Q and II which is called by Grassmann external and 
which might be defined as the distance of the point from the plane. 
We may write this 

x il=0. (24) 


I 
Wie 
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To show that so simple an expression is really amenable to analytical 
treatment, I observe that Q may be expressed in terms of any four 
points (not in the same plane) on the barycentrie principle explained 


above, viz., Q=vA+yB+20+wD, (25) 


and IT may be expressed in terms of combinatorial products of 
A, B, C, and D, viz., 


Il=€BxCxD+7C0xAxD+€DxAxB+wAxCxB, (26) 


and by these substitutions, by the laws of the combinatorial product 
to be mentioned hereafter, equation (24) is transformed into 


wo+ecE+yn+zé=0, (27) 
which is identical with the formula of ordinary analysis.* 

I have gone at length into this very simple point in order to 
illustrate the fact, which I think is a general one, that the modern 
geometry is not only tending to results which are appropriately 
expressed in multiple algebra, but that it is actually striving to clothe 
itself in forms which are remarkably similar to the notations of 
multiple algebra, only less simple and general and far less amenable 
to analytical treatment, and therefore, that a certain logical necessity 
calls for throwing off the yoke under which analytical geometry has 
so long labored. And lest this should seem to be the utterance of an 
uninformed enthusiasm, or the echoing of the possibly exaggerated 
claims of the devotees of a particular branch of mathematical study, 
I will quote a sentence from Clebsch and one from Clifford, relating 
to the past and to the future of multiple algebra. The former in 
his eulogy on Pliicker,t in 1871, speaking of recent advances in 
geometry, says that “in a certain sense the coordinates of a straight 
line, and in general a great part of the fundamental conceptions of 
the newer algebra, are contained in the Ausdehnungslehre of 1844,” 
and Clifford ¢ in the last year of his life, speaking of the Ausdehn- 
ungslehre, with which he had but recently become acquainted, 
expresses “his profound admiration of that extraordinary work, and 
his conviction that its principles will exercise a vast influence upon 
the future of mathematical science.” 

Another subject in which we find a tendency toward the forms 
and methods of multiple algebra, is the calculus of operations. Our 
ordinary analysis introduces operators, and the successive operations 
A and B may be equivalent to the operation C. To express this in 
an equation we may write 


BA(a)=C(a), 


*The letters & 7, ¢ w, here denote the distances of the plane II from the points 
A, B, CG, D, divided by six times the volume of the tetrahedron, A, B, C, D. The 
letters x, y, z, w, denote the tetrahedral coordinates as above. 

+ Gott. Abhandl., vol. xvi, p. 28. - t Amer. Journ. Math., vol. i, p. 350. 
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where w is any quantity or function. We may also have occasion to 


write A(a)+B(a)=D(x), or (A+B)(x)= D(a). 
But it is almost impossible to resist the tendency to express these 
relations in the form BAsE 

A+B=D, 


in which the operators appear in a sense as quantities, i.e., as subjects 
of functional operation. Now since these operators are often of such 
nature that they cannot be perfectly specified by a single numerical 
quantity, when we treat them as quantities they must be regarded as 
multiple quantities. In this way certain formule which essentially 
belong to multiple algebra get a precarious footing where they are 
only allowed because they are regarded as abridged notations for 
equations in ordinary algebra. Yet the logical development of such 
notations would lead a good way in multiple algebra, and doubtless 
many investigators have entered the field from this side. 

One might also notice, to show how the ordinary algebra is be- 
coming saturated with the notions and notations which seem destined 
to turn it intc a multiple algebra, the notation so common in the 


higher algebra (a, b, c)(a, y, 2) 
for e+ by + cz. 


This is evidently the same as Grassmann’s internal product of the 
multiple quantities (a, b, c) and (a, y, 2), or, in the language of 
quaternions, the scalar part, taken negatively, of the product of the 
vectors of which a, b,c and a, y, are the components. A similar 
correspondence with Grassmann’s methods might, I think, be shown 
in such notations as, for example, 


(a, b, ¢, d)(a, y)*. 


The free admission of such notations is doubtless due to the fact that 
they are regarded simply as abridged notations. 

The author of the celebrated “Memoir on the Theory of Matrices” 
goes much farther than this in his use of the forms of multiple 
algebra. Thus he writes explicitly one equation to stand for several, 
without the use of any of the analytical artifices which have been 
mentioned. This work has indeed, as we have seen, been characterized 
as marking the commencement of multiple algebra,—a view to which 
we can only take exception as not doing justice to earlier writers. 

But the significance of this memoir with regard to the point which 
I am now considering is that it shows that the chasm so marked 
in the second quarter of this century is destined to be closed up. 
Notions and notations for which a Cayley is sponsor will not be 
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excluded from good society among mathematicians. And if we admit 
as suitable the notations used in this memoir (where it is noticeable 
that the author rather avoids multiple algebra, and only uses it very 
sparingly), we shall logically be brought to use a great deal more. 
For example, if it is a good thing to write in our equations a single 
letter to represent a matrix of n? numerical quantities, why not use 
a single letter to represent the » quantities operated upon, as Grass- 
mann and Hamilton have done? Logical consistency seems to de- 
mand it. And if we may use the sign )( to denote an operation by 
which two sets of quantities are combined to form a third set, as is 
the case in this memoir, why not use other signs to denote other 
functional operations of which the result is a multiple quantity ? 
If it be conceded that this is the proper method to follow where 
simplicity of conception, or brevity of expression, or ease of trans- 
formation is served thereby, our algebra will become in large part a 
multiple algebra. 


We have considered the subject a good while from the outside; we 
have glanced at the principal events in the history of multiple algebra; 
we have seen how the course of modern thought seems to demand its 
aid, how it is actually leaning toward it, and beginning to adopt its 
methods. It may be worth while to direct our attention more 
critically to multiple algebra itself, and inquire into its essential 
character and its most important principles. 

I do not know that anything useful or interesting, which relates 
to multiple quantity, and can be symbolically expressed, falls outside 
of the domain of multiple algebra. But if it is asked, what notions 
are to be regarded as fundamental, we must answer, here as else- 
where, those which are most simple and fruitful. Unquestionably, no 
relations are more so than those which are known by the names 
of addition and multiplication. 

Perhaps I should here notice the essentially different manner in 
which the multiplication of multiple quantities has been viewed 
by different writers. Some, as Hamilton, or De Morgan, or Peirce, 
speak of the product of two multiple quantities, as if only one product 
could exist, at least in the same algebra. Others, as Grassmann, speak 
of various kinds of products for the same multiple quantities. Thus 
Hamilton seems for many years to have agitated the question, what 
he should regard as the product of each pair of a set of triplets, or 
in the geometrical application of the subject, what he should regard 
as the product of each pair of a system of perpendicular directed 
lines.* Grassmann asks, What products, ie. what distributive 
functions of the multiple quantities, are most important ? 


* Phil. Mag. (3), vol. xxv, p. 490; North British Review, vol. xlv, (1866), p. 57. 
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It may be that in some cases the fact that only one kind of product 
is known in ordinary algebra has led those to whom the problem 
presented itself in the form of finding a new algebra to adopt this 
characteristic derived from the old. Perhaps the reason lies deeper 
in a distinction like that in arithmetic between concrete and abstract 
numbers or quantities. The multiple quantities corresponding to 
conerete quantities such as ten apples or three miles are evidently 
such combinations as ten apples + seven oranges, three miles north- 
ward + five miles eastward, or six miles in a direction fifty degrees 
east of north. Such are the fundamental multiple quantities from 
Grassmann’s point of view. But if we ask what it is in multiple 
algebra which corresponds to an abstract number like twelve, which 
is essentially an operator, which changes one mile into twelve miles, 
and $1,000 into $12,000, the most general answer would evidently 
be: an operator which will work such changes as, for example, that 
of ten apples + seven oranges into fifty apples + 100 oranges, or that 
of one vector into another. 

Now an operator has, of course, one characteristic relation, viz., its 
relation to the operand. This needs no especial definition, since it 
is contained in the definition of the operator. If the operation 
is distributive, it may not inappropriately be called multiplication, 
and the result is par excellence the product of the operator and 
operand. The sum of operators gud operators, is an operator which 
gives for the product the sum of the products given by the operators 
to be added. The product of two operators is an operator which is 
equivalent to the successive operations of the factors. This multi- 
plication is necessarily associative, and its definition is not really 
different from that of the operators themselves. And here I may 
observe that Professor C. S. Peirce has shown that his father’s 
associative algebras may be regarded as operational and matricular.* 

Now the calculus of distributive operators is a subject of great 
extent and importance, but Grassmann’s view is the more compre- 
hensive, since it embraces the other with something besides. For 
every quantitative operator may be regarded as a quantity, ie, as 
the subject of mathematical operation, but every quantity cannot 
be regarded as an operator; precisely as in grammar every verb may 
_be taken as substantive, as in the infinitive, while every substantive 
does not give us a verb. 

Grassmann’s view seems also the most practical and convenient. 
For we often use many functions of the same pair of multiple 
quantities, which are distributive with respect to both, and we need 
some simple designation to indicate a property of such fundamental 


* Amer, Jown. Math., vol. iv, p. 221. 
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importance in the algebra of such functions, and no advantage 
appears in singling out a particular function to be alone called 
the product. Even in quaternions, where Hamilton speaks of 
only one product of two vectors (regarding it as a special case 
of the product of quaternions, ie., of operators), he nevertheless 
comes to use the scalar part of this product and the vector part 
separately. Now the distributive law is satisfied by each of these, 
which therefore may conveniently be called products. In this 
sense we have three kinds of products of vectors in Hamilton’s 
analysis. 

Let us then adopt the more general view of multiplication, and call 
any function of two or more multiple quantities, which is distributive 
with respect to all, a product, with only this limitation, that when 
one of the factors is simply an ordinary algebraic quantity, its effect 
is to be taken in the ordinary sense. 

It is to be observed that this definition of multiplication implies 
that we have an addition both of the kind of quantity to which the 
product belongs, and of the kinds to which the factors belong. Of 
course, these must be subject to the general formal laws of addition. 
I do not know that it is necessary for the purposes of a general 
discussion to stop to define these operations more particularly, either 
on their own account or to complete the definition of multiplication. 
Algebra, as a formal science, may rest on a purely formal foundation. 
To take our illustration again from mechanics, we may say that 
if a man is inventing a particular machine,—a sewing machine, a 
reaper,—nothing is more important than that he should have a precise 
idea of the operation which his machine is to perform, yet when he is 
treating the general principles of mechanics he may discuss the lever, 
or the form of the teeth of wheels which will transmit uniform 
motion, without inquiring the purpose to which the apparatus is to 
be applied; and in like manner that if we were forming a particular 
algebra,—a geometrical algebra, a mechanical algebra, an algebra 
for the theory of elimination and substitution, an algebra for the 
study of quantics,—we should commence by asking, What are the 
multiple quantities, or sets of quantities, which we have to consider ? 
What are the additive relations between them? What are the multi- 
plicative relations between them? etc. forming a perfectly defined 
and complete idea of these relations as we go along; but in the 
development of a general algebra no such definiteness of conception 

is requisite. Given only the purely formal law of the distributive 
character of multiplication,—this is sufficient for the foundation of a 
science. Nor will such a science be merely a pastime for an ingenious 
mind. It will serve a thousand purposes in the formation of parti- 
cular algebras. Perhaps we shall find that in the most important 
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cases the particular algebra is little more than an application or 
interpretation of the general. 

Grassmann observes that any kind of multiplication of n-fold 
quantities is characterized by the relations which hold between the 
products of » independent units. In certain kinds of multiplication 
these characteristic relations will hold true of the products of any 
of the quantities. 

Thus if the value of a product is independent of the order of the 
factors when these belong to the system of units, it will always be 
independent of the order of the factors. The kind of multiplication 
characterized by this relation and no other between the products is 
called by Grassmann algebraic, because its rules coincide with those 
of ordinary algebra. It is to be observed, however, that it gives 
rise to multiple quantities of higher orders. If independent units 


are required to express the original quantities, nna units will be 


required for the products of two factors, RS) at for the 


products of three factors, etc. 

Again, if the value of a product of factors belonging to a system 
of units is multiplied by —1 when two factors change places, the 
same will be true of the product of any factors obtained by addition 
of the units. The kind of multiplication characterized by this relation 
and no other is called by Grassmann external or combinatorial. For 
our present purpose we may denote it by the sign x. It gives rise 


to multiple quantities of higher orders, units being required 


(n—1)(n—2) 
Dee 


to express the products of two factors, n units for 


products of three factors, ete. All products of more than 7 factors 

are zero. The products of n factors may be expressed by a single 

unit, viz, the product of the » original units taken in a specified 

order, which is generally set equal to 1. The products of n—1 factors 

are expressed in terms of 1 units, those of n—2 factors in terms of 
n— 


m —5— units, ete. This kind of multiplication is associative, like the 


algebraic. 

Grassmann observes, with respect to binary products, that these 
two kinds of multiplication are the only kinds characterized by laws 
which are the same for any factors as for particular units, except 
indeed that characterized by no special laws, and that for which all 
products are zero.* The last we may evidently reject as nugatory. 
That for which there are no special laws, ie., in which no equations 


* Crelle’s Journ. f. Math., vol. xlix, p. 138. 
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subsist between the products of a system of independent units, is also’ 
rejected by Grassmann, as not appearing to afford important applica- 
tions. I shall, however, have occasion to speak of it, and shall call 
it the indeterminate product. In this kind of multiplication, n? units 
are required to express the products of two factors, and 7? units for 
products of three factors, ete. It evidently may be regarded as 
associative. 

Another very important kind of multiplication is that called by 
Grassmann internal. In the form in which I shall give it, which is 
less general than Grassmann’s, it is in one respect the most simple of 
all, since its only result is a numerical quantity. It is essentially 
binary and characterized by laws of the form 

tela g7=k k.k=1, ete, 

2.9 =0, p.t=0, ete., 
where 7, 7, k, etc., represent a system of independent units. I use the 
dot as significant of this kind of multiplication. 

Grassmann derives this kind of multiplication from the com- 
binatorial by the following process. He defines the complement 
(Erganzung) of a unit as the combinatorial product of all the other 
units, taken with such a sign that the combinatorial product of the 
unit and its complement shall be positive. The combinatorial product 
of a unit and its complement is therefore unity, and that of a unit 
and the complement of any other unit is zero. The internal product 
of two units is the combinatorial product of the first and the com- 
plement of the second. 

It is important to observe that any scalar product of two factors of 
the same kind of multiple quantities, which is positive when the 
factors are identical, may be regarded as an internal product, ie., we 
may always find such a system of units, that the characteristic 
equations of the product will reduce to the above form. The nature 
of the subject may afford a definition of the product independent of 
any reference to a system of units. Such a definition will then have 
obvious advantages. An important case of this kind occurs in 
geometry in that product of two vectors which is obtained by multi- 
plying the products of their lengths by the cosine of the angle which 
they include. This is an internal product in Grassmann’s sense. 

Let us now return to the indeterminate product, which I am 
inclined to regard as the most important of all, since we may derive 
from it the algebraic and the combinatorial. For this end we will 
prefix = to an indeterminate product to denote the sum of all the 
terms obtained by taking the factors in every possible order. Then, 


LalBly, 
for instance, where the vertical line is used to denote the 
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indeterminate product,* is a distributive function of a, @, and y. It 
is evidently not affected by changing the order of the letters. It is, 
therefore, an algebraic product in the sense in which the term has 
been defined. 

So, again, if we prefix 2+ to an indeterminate product to denote 
the sum of all terms obtained by giving the factors every possible 
order, those terms being taken negatively which are obtained by an 
odd number of simple permutations, 


z+alBly, 


for instance, will be a distributive function of a, 8, y, which is 
multiplied by —1 when two of these letters change places. It will 
therefore be a combinatorial product. 

It is a characteristic and very important property of an indeter- 
minate product that every product of all its factors with any other 
quantities is also a product of the indeterminate product and. the 
other quantities. We need not stop for a formal proof of this pro- 
position, which indeed is an immediate consequence of the definitions 
of the terms. 

These considerations bring us naturally to what Grassmann calls 
regressive multiplication, which I will first illustrate by a very 
simple example. If n, the degree of multiplicity of our original 
quantities, is 4, the combinatorial product of ax@xy and 6xe, viz., 

axBxyx0xe, 
is necessarily zero, since the number of factors exceeds four. But if 
for 6xe we set its equivalent 
dje—e|d, 
we may multiply the first factor in each of these indeterminate pro- 
ducts combinatorially by ax8xy, and prefix the result, which is a 
numerical quantity, as coefficient to the second factor. This will give 
(axPBxyxd)e—(axBxyxe)é. 

Now, the first term of this expression is a product of ax®xy, 6, and 
¢, and therefore, by the principle just stated, a product of axBxy 
and d|e. The second term is a similar product of ax®@xy and e|6. 
Therefore the whole expression is a product of ax8xy and d\e—elé, 
that is, of ax®8xy and éxe. That is, except in sign, what Grass- 
mann calls the regressive product of axBxy and éxe. 


To generalize this process, we first observe that an expression of 
the form 


= +axBlyxo, 


in which each term is an indeterminate product of two combinatorial 
products, and in which 2+ denotes the sum of all terms obtained by 


* This notation must not be confounded with Grassmann’s use of the vertical line. 
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putting every different pair of the letters before the dividing line, the 
negative sign being used for any terms which may be obtained by an 
odd number of simple permutations of the letters,—in other words 
the expression 


axBlyxd—axy|Bxd—axd|yxB+Bxylaxd 
—Bxdlaxy+yxd|axB, 
is a distributive function of a, 8, y, and 6, which is multiplied by —1 
_ when two of these letters change places, and may, therefore, be regarded 


as equivalent to the combinatorial product ax8xyxé. Now, if n=5, 
the combinatorial product of 


pxoXtr and axBxyxo6 


> 


is zero. But if we multiply the first member of each of the above 
indeterminate products by pXoxXz, and prefix the result as co- 
efficient to the second member, we obtain 


(pXoXTXaxP)yxd—(pXoxXrXaxy)Bxéd+ete,, 


which is what Grassmann calls the regressive product of pxoxz and 
ax6xyx0d. It is easy to see that the principle may be extended so 
as to give a regressive product in any case in which the total number 
of factors of two combinatorial products is greater than n. Also, that 
we might form a regressive product by treating the first of the given 
combinatorials as we have treated the second. It may easily be shown 
that this would give the same result, except in some cases with a 
difference of sign. To avoid this inconvenience, we may make the 
rule, that whenever in the substitution of a sum of indeterminate 
products for a combinatorial, both factors of the indeterminate products 
are of odd degree, we change the sign of the whole expression. With 
this understanding, the results which we obtain will be identical with 
Grassmann’s regressive product. The propriety of the name consists 
in the fact that the product is of less degree than either of the factors. 
For the contrary reason, the ordinary external or combinatorial 
multiplication is sometimes called by Grassmann progressive. 

Regressive multiplication is associative and exhibits a very remark- 
able analogy with the progressive. This analogy I have not time 
here to develop, but will only remark that in this analogy lies in its 
most general form that celebrated principle of duality, which appears 
in various forms in geometry and certain branches of analysis. 

To fix our ideas, I may observe that in geometry the progressive 
multiplication of points gives successively lines, planes and volumes ; 
the regressive multiplication of planes gives successively lines, points 
and scalar quantities. 

The indeterminate product affords a natural key to the subject of 
matrices. In fact, a sum of indeterminate products of the second 
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degree represents n? scalars, which constitute an ordinary or quadratic 
matrix; a sum of indeterminate products of the third degree repre- 
sents n® scalars, which constitute a cubic matrix, ete. I shall confine 
myself to the simplest and most important case, that of quadratic 
matrices. 
An expression of the form 
a(A.p) 
being a product of a, X, and p, may be regarded as a product of ald 
and p, by a principle already stated. Now if & denotes a sum of 
indeterminate products, of second degree, say a|A+$|u+ete., we 
may write 
P.p 
for a(X.p) + B(u.p) + ete. 


This is like p, a quantity of the first degree, and it is a homogeneous 
linear function of p. It is easy to see that the most general form 
of such a function may be expressed in this way. An equation like 


c=P.p 
represents 1 equations in ordinary algebra, in which 7 variables are 


expressed as linear functions of n others by means of n? coefficients. 
The internal product of two indeterminate products may be defined 


Uae ian (a|8).(y|8)=(B-y) a6. 
This defines the internal product of matrices, as 
WV. 


This product evidently gives a matrix, the operation of which is 
equivalent to the successive operations of @ and V; i.e., 
(V2). p=V.(P.p). 

We may express this a little more generally by saying that internal 
multiplication is associative when performed on a series of matrices, 
or on such a series terminated by a quantity of the first degree. 

Another kind of multiplication of binary indeterminate products 
is that in which the preceding factors are multiplied combinatorially, 
and also the following. It may be defined by the equation 

(alA)x(B|u)x(y|v) =axBxy|Axpxy. 
This defines a multiplication of matrices denoted by the same symbol, 
is BVO, BEV EOXO. . 
This multiplication, which is associative and commutative, is of great 
importance in the theory of determinants. In fact, 
i x” 
x 


|” 
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is the determinant of the matrix @. <A lower power, as the m, with 
the divisor n(n—1)...(n—m-+1) would express as multiple quantity 
all the subdeterminants of order m.* 

Tt is evident that by the combination of the operations of inde- 
terminate, algebraic, and combinatorial multiplication we obtain 
multiple quantities of a more complicated nature than by the use 
of only one of these kinds of multiplication. The indeterminate 
product of combinatorial products we have already mentioned. The 
combinatorial product of algebraic products, and the indeterminate 
product of algebraic products, are also of great importance, especially 
in the theory of quantics. These three multiplications, with the 
internal, especially in connection with the general property of the 
indeterminate product given above, and the derivation of the algebraic 
and combinatorial products from the indeterminate, which affords a 
generalization of that property, give rise to a great wealth of multi- 
plicative relations between these multiple quantities. I say “wealth 
of multiplicative relations” designedly, for there is hardly any kind 
of relations between things which are the objects of mathematical 
study, which add so much to the resources of the student as those 
which we call multiplicative, except perhaps the simpler class which 
we call additive, and which are presupposed in the multiplicative. 
This is a truth quite independent of our using any of the notations 
of multiple algebra, although a suitable notation for such relations 
will of course increase their value. 


Perhaps, before closing, I ought to say a few words on the appli- 
cations of multiple algebra. 

First of all, geometry, and the geometrical sciences which treat of 
things having position in space, kinematics, mechanics, astronomy 
physies, crystallography, seem to demand a method of this kind, for 
position in space is essentially a multiple quantity and can only be 
represented by simple quantities in an arbitrary and cumbersome 
manner. For this reason, and because our spatial intuitions are more 
developed than those of any other class of mathematical relations, 
these subjects are especially adapted to introduce the student to the 
methods of multiple algebra. Here, Nature herself takes us by the 
hand and leads us along by easy steps, as a mother teaches her child 
to walk. In the contemplation of such subjects, Mobius, Hamilton, 


* Quadratic matrices may also be represented by a sum of indeterminate products of 
a quantity of the first degree with a combinatorial product of (n—1)st degree, as, for 
example, when n=4, by a sum of products of the form 

a|Bxyx0d. 

The theory of such matrices is almost identical with that of those of the other form, 
except that the external multiplication takes the place of the internal, in the multipli- 
cation of the matrices with each other and with quantities of the first degree. 

(Ge dilk H 
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and Grassmann formed their algebras, although the philosophical 
mind of the last was not satisfied until he had produced a system 
unfettered by any spatial relations. It is probably in connection with 
some of these subjects that the notions of multiple algebra are most 
widely disseminated. 

Maxwell’s Treatise on Electricity and Magnetism has done so 
much to familiarize students of physics with quaternion notations, 
that it seems impossible that this subject should ever again be entirely 
divorced from the methods of multiple algebra. 

I wish that I could say as much of astronomy. It is, I think, to be 
regretted, that the oldest of the scientific applications of mathematics, 
the most dignified, the most conservative, should keep so far aloof 
from the youngest of mathematical methods; and standing as I do 
to-day, by some chance, among astronomers, although not of the guild, 
I cannot but endeavor to improve the opportunity by expressing my 
conviction of the advantages which astronomers might gain by 
employing some of the methods of multiple algebra. A very few of 
the fundamental notions of a vector analysis, the addition of vectors 
and what quaternionists would call the scalar part and the vector 
part of the product of two vectors (which may be defined without 
the notion of the quaternion),—these three notions with some four 
fundamental properties relating to them are sufficient to reduce 
enormously the labor of mastering such subjects as the elementary 
theory of orbits, the determination of an orbit from three observations, 
the differential equations which are used in determining the best orbit 
from an indefinite number of observations by the method of least 
squares, or those which give the perturbations when the elements are 
treated as variable. In all these subjects the analytical work is 
greatly simplified, and it is far easier to find the best form for 
numerical calculation than by the use of the ordinary analysis. 

I may here remark that in its geometrical applications multiple 
algebra will naturally take one of two principal forms, according as 
vectors or points are taken as elementary quantities, ie., according as 
something having magnitude and direction, or something having 
magnitude and position at a point, is the fundamental conception. 
These forms of multiple algebra may be distinguished as vector 
analysis and point analysis, The former we may call a triple, the 
latter a quadruple algebra, if we determine the degree of the algebra 
from the degree of multiplicity of the fundamental conception. The 
former is included in the latter, since the subtraction of points gives 
us vectors, and in this way Grassmann’s vector analysis is included in 
his point analysis. Hamilton’s system, in which the vector is the 
fundamental idea, is nevertheless made a quadruple algebra by the 
addition of ordinary numerical quantities. For practical purposes we 


— — 
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may regard Hamilton’s system as equivalent to Grassmann’s algebra 
of vectors. Such practical equivalence is of course consistent with 
great differences of notation, and of the point of view from which the 
subject is regarded. 

Perhaps I should add a word in regard to the nature of the problems 
which require a vector analysis, or the more general form of Grass- 
mann’s point analysis. The distinction of the problems is very marked, 
and corresponds precisely to the distinction familiar to all analysts 
between problems which are suitable for Cartesian coordinates, and 
those which are suitable for the use of tetrahedral, or, in plane 
geometry, triangular coordinates. Thus, in mechanics, kinematics, 
astronomy, physics, or crystallography, Grassmann’s point analysis 
will rarely be wanted. One might teach these subjects for years by a 
vector analysis, and never perhaps feel the need of any of the notions 
or notations which are peculiar to the point analysis, precisely as in 
ordinary algebra one might use the Cartesian coordinates in teaching 
these subjects, without any occasion for the use of tetrahedral coor- 
dinates. I think of one exception, which, however, confirms the 
rule. The very important theory of forces acting on a rigid body is 
much better treated by point analysis than by vector analysis, exactly 
as in ordinary algebra it is much better treated by tetrahedral coor- 
dinates than by Cartesian,—I mean for the purpose of the elegant 
development of general propositions. A sufficient theory for the 
purposes of numerical calculations can easily enough be given by any 
method, and the most familiar to the student is for such practical 
purposes of course the best. On the other hand, the projective pro- 
perties of bodies, the relations of collinearity, and similar subjects, 
seem to demand the point analysis for their adequate treatment. 

If I have said that the algebra of vectors is contained in the algebra 
of points, it does not follow that in a certain sense the algebra of 
points is not deducible from the algebra of vectors. In mathematics, 
a part often contains the whole. If we represent points by vectors 
drawn from a common origin, and then develop those relations 
between such vectors representing points, which are independent of 
the position of the origin,—by this simple process we may obtain a 
large part, possibly all, of an algebra of points. In this way the 
vector analysis may be made to serve very conveniently for many of 
those subjects which I have mentioned as suitable for point analysis. 
The vector analysis, thus enlarged, is hardly to be distinguished from 
a point analysis, but the treatment of the subject in this way has 
somewhat of a makeshift character, as distinguished from the unity 
and simplicity of the subject when developed directly from the idea of 
something situated at a point. 

Of those subjects which have no relations to space, the elementary 
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theory of eliminations and substitutions, including the theory of 
matrices and determinants, seems to afford the most simple application 
of multiple algebra. I have already indicated what seems to me the 
appropriate foundation for the theory of matrices. The method is 
essentially that which Grassmann has sketched in his first Ausdehn- 
ungslehre under the name of the open product and has developed 
at length in the second. 

In the theory of quantics Grassmann’s algebraic product finds an 
application, the quantic appearing as a sum of algebraic products in 
Grassmann’s sense of the term. As it has been stated that these 
_ products are subject to the same laws as the ordinary products of 
algebra, it may seem that we have here a distinction without an 
important difference. If the quantics were to be subject to no farther 
multiplications, except the algebraic in Grassmann’s sense, such an 
objection would be valid. But quantics regarded as sums of algebraic 
products, in Grassmann’s sense, are multiple quantities and subject to 
a great variety of other multiplications than the algebraic, by which 
they were formed. Of these the most important are doubtless the 
combinatorial, the internal, and the indeterminate. ‘The combinatorial 
and the internal may be applied, not only to the quantic as a whole or 
to the algebraic products of which it consists, but also to the indi- 
vidual factors in each term, in accordance with the general principle 
which has been stated with respect to the indeterminate product and 
which will apply also to the algebraic, since the algebraic may be 
regarded as a sum of indeterminate products. 

In the differential and integral calculus it is often advantageous 
to regard as multiple quantities various sets of variables, especially 
the independent variables, or those which may be taken as such. 
It is often convenient to represent in the form of a single differential 


coefficient, as 
dr 


dp’ 

a block or matrix of ordinary differential coefficients. In this 
expression, p may be a multiple quantity representing say 1 inde- 
pendent variables, and 7 another representing perhaps the same 
number of dependent variables. Then dp represents the n differ- 
entials of the former, and dr the nm differentials of the latter. The 
whole expression represents an operator which turns dp into dr, 
so that we may write identically 


Here we see a matrix of n? differential coefficients represented by 
a quotient. This conception is due to Grassmann, as well as the 
representation of the matrix by a sum of products, which we have 


a 
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already considered. It is to be observed that these multiple differ- 
ential coefficients are subject to algebraic laws very similar to those 
which relate to ordinary differential coefficients when there is a 
single independent variable, €.2., 


do dr_ da 
dr do dp’ 
dp dr_ 
dr dp 


In the integral calculus, the transformation of multiple integrals 
by change of variables is made very simple and clear by the methods 
of multiple algebra. 

In the geometrical applications of the calculus, there is a certain 
class of theorems, of which Green’s and Poisson’s are the most 
notable examples, which seem to have been first noticed in connection 
with certain physical theories, especially those of electricity and 
magnetism, and which have only recently begun to find their way 
into treatises on the calculus. These not only find simplicity of 
expression and demonstration in the infinitesimal calculus of multiple 
quantities, but also their natural position, which they hardly seem 
to find in the ordinary treatises. 

But I do not so much desire to call your attention to the diversity 
of the applications of multiple algebra, as to the simplicity and 
unity of its principles. The student of multiple algebra suddenly 
finds himself freed from various restrictions to which he has been 
accustomed. To many, doubtless, this liberty seems like an invi- 
tation to license. Here is a boundless field in which caprice may 
riot. It is not strange if some look with distrust for the result 
of such an experiment. But the farther we advance, the more 
evident it becomes that this too is a realm subject to law. The 
more we study the subject, the more we find all that is most useful 
and beautiful attaching itself to a few central principles. We 
begin by studying multiple algebras; we end, I think, by studying 
MULTIPLE ALGEBRA. 
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ON THE DETERMINATION OF ELLIPTIC ORBITS FROM 
THREE COMPLETE OBSERVATIONS. 


[Memoirs of the National Academy of Sciences, vol. Iv. part I. 
pp. 79-104, 1889.] 


THE determination of an orbit from three complete observations 
by the solution of the equations which represent elliptic motion 
presents so great difficulties in the general case, that in the first 
solution of the problem we must generally limit ourselves to the case 
in which the intervals between the observations are not very long. In 
this case we substitute some comparatively simple relations between 
the unknown quantities of the problem, which have an approximate 
validity for short intervals, for the less manageable relations which 
rigorously subsist between these quantities. A comparison of the 
approximate solution thus obtained with the exact laws of elliptic 
motion will always afford the means of a closer approximation, and 
by.a repetition of this process we may arrive at any required degree 
of accuracy. 

It is therefore a problem not without interest—it is, in fact, the 
natural point of departure in the study of the determination of orbits 
—to express in a manner combining as far as possible simplicity and 
accuracy the-relations between three positions in an orbit separated 
by small or moderate intervals. The problem is not entirely deter- 
minate, for we may lay the greater stress upon simplicity or upon 
accuracy ; we may seek the most simple relations which are sufficiently 
accurate to give us any approximation to an orbit, or we may seek 
the most exact expression of the real relations, which shall not be too 
complex to be serviceable. 


Derivation of the Fundamental Equation. 


The following very simple considerations afford a vector equation, 
not very complex and quite amenable to analytical transformation, 
which expresses the relations between three positions in an orbit 
separated by small or moderate intervals, with an accuracy far 


exceeding that of the approximate relations generally used in the 
determination of orbits. 


as a 
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If we adopt such a unit of time that the acceleration due to the 
sun’s action is unity at a unit’s distance, and denote the vectors * 
drawn from the sun to the body in its three positions by #,, R,, Rs, 
and the lengths of these vectors (the heliocentric distances) by 7,, 
%g, 13, the accelerations corresponding to the three positions will be 


KR 
represented by at =a, ay Now the motion between the 
1 2 3 


positions considered may be expressed with a high degree of accuracy 
by an equation of the form 


R=A4+7B+PC+HD+HE, 
having five vector constants. The actual motion rigorously satisfies 
six conditions, viz., if we write 7; for the interval of time between the 


* Vectors, or directed quantities, will be represented in this paper by German capitals. 
The following notations will be used in connection with them : 

The sign = denotes identity in direction as well as length. 

The sign + denotes geometrical addition, or what is called composition in mechanics. 

The sign — denotes reversal of direction, or composition after reversal. 

The notation 2[.8 denotes the product of the lengths of the vectors and the cosine of 
the angle which they include. It will be called the direct product of & and B. If 
x, y, 2 are the rectangular components of 9, and 2’, y’, 2’ those of B, 

WM. B= awa!’ + yy’ +22. 

Y.2f may be written 2? and called the square of Y. 

The notation 2x will be used to denote a vector of which the length is the product 
of the lengths of 2{ and $ and the sine of the angle which they include. Its direction 
is perpendicular to 9f and %, and on that side on which a rotation from Y% to B 
appears counter-clockwise. It will be called the skew product of YU and %. If the 
rectangular components of Y and % are x, y, z, and 2’, y’, z’, those of xB will be 

yz — zy’, zac — 202, wy = ye". 

The notation (YC) denotes the volume of the parallelopiped of which three edges are 
obtained by laying off the vectors 2, 8, and € from any same point, which volume is to 
be taken positively or negatively, according as the vector € falls on the side of the plane 
containing 9 and %, on which a rotation from 2 to B appears counter-clockwise, or on 
the other side. If the rectangular components of 2%, G, and © are x, y, z; 2’, y’, 2’; 
NGG 5 375 5 


x y z 
( WB € ) = a’ y' z! 
a!’ y” nl! 


It follows, from the above definitions, that for any vectors YM, B, and € 
Y.S=B.A, UxB= —- Bx, 
(ABC) = (BCA) = (CAB) = — (ACB) = — (CBA) = - (VAQ), 
and (ABC) =A. (Bx GC) =B. (Cx W) =C.(UAxB) 5 
also that .%, 2%, are distributive functions of % and B, and (2BC) a distributive 
function of 2, B, and ©, for example, that if Y=L+ Me, 
YN. B=L.B+M.B, AxB=Lx B+ MxB, (AVC) = (LBC) + (MBO), 
and so for ¥ and G. 

The notation (Y{BC) is identical with that of Lagrange in the Mécanique Analytique, 
except that there its use is limited to unit vectors. The signification of fx is closely 
related to, but not identical with, that of the notation [7,7,] commonly used to denote 
the double area of a triangle determined by two positions in an orbit. 
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first and second positions, and 7, for that between the second and 
third, and set t=0 for the second position, 


for t= —;, a of 
R=R,, dia 

Lon t=, 

fort =7,, a i 
R= R,, a ae 


We may therefore write with a high degree of approximation 
KR, =A-7,8+7,2E—73D+7;'E 
R, = A 
R,=A+7,8+7/7°C+77D—7/E 


——j= 26 —67,0+127,°E 
re 

a 286 

9 

— 7 


=26467,D +1276. 


From these six Ce the five constants %, 8, €, D, € may be 
eliminated, leaving a single equation of the form 


A,(14+54) (1-72) +4, (1473) = =0, (1) 
where ees a5 
Ve ty tts’ a T +73 


By =de(- 72+ 7y73+732), By=pe(72+37,7,+7,7) 
By= ps (1) +7,73—Ts’). 
This we shall call our fundamental equation. In order to discuss 
its geometrical signification, let us set 


= B, (122 = ( Bs 
n= A,(1 oa m= (1-73), N= A, 1+73). (2) 
so that the equation will read 
MR, — NR, +1, R; = 0. (3) 


This expresses that the vector 7, is the diagonal of a parallelogram 
of which n,%, and n,%, are sides. If we multiply by #, and by Jus 
un skew nvultiplication, we get 

MR XRz—NLRXR,=0, —NQH, XR, +72gK, xR, = 0, (4) 
whence FiexPs _ HxRs _ RL xI 


Ny Ns Ng (5) 


a 
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Our equation may therefore be regarded as signifying that the three 
vectors 9,, 3, R, lie in one plane, and that the three triangles 
determined each by a pair of these vectors, and usually denoted by 
[7273], [17,73], [772], are proportional to 
B 
4,(1+53), (1-3), As(1+>8). 

Since this vector equation is equivalent to three ordinary equations, 
it is evidently sufficient to determine the three positions of the body 
in connection with the conditions that these positions must lie upon 
the lines of sight of three observations. To give analytical expression 
to these conditions, we may write ©, ©,, ©, for the vectors drawn 
from the sun to the three positions of the earth (or, more exactly, of 
the observatories where the observations have been made), Dis Bass 
for unit vectors drawn in the directions of the body, as observed, 
and p,, p2, 3 for the three distances of the body from the places of 
observation. We have then 


R= +p, R= G+ pobe, RK; = E+ pss. (6) 


By substitution of these values our fundamental equation becomes 
B a B Be 
A(1 + ANG, + py) - (1 a “NG ae P2b2) +As(14+23)(G+ p303)= 0, (7) 


where p,, ps, P3> 71 To» 73 (the geocentric and heliocentric distances) 
are the only unknown quantities. From equations (6) we also get, 
by squaring both members in each, 


ee = €? er 2(€,.0,) pit pi’; rT re G? = 2(€.82) ps = po", 
Ts = G, +2(€3.83) p3+ ps, 


by which the values of r,, 7,, 73 may be derived from those of 
Pi» Po» P3, Or vice versd. Equations (7) and (8), which are equivalent 
to six ordinary equations, are sufficient to determine the six quantities 
1, To) 13> Py» Pa» P33 OF, if we suppose the values of 7,, 7,, 7; in terms 
of p,, pz, pz to be substituted in equation (7), we have a single vector 
equation, from which we may determine the three geocentric distances 
2 ee to be shown, first, how the numerical solution of the 
equation may be performed, and secondly, how such an approximate 
solution of the actual problem may furnish the basis of a closer 
approximation. 


(8) 


Solution of the Fundamental Equation. 


The relations with which we have to do will be rendered a little 
more simple if instead of each geocentric distance we introduce the 
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distance of the body from the foot of the perpendicular from the sun 
upon the line of sight. If we set 
= pit(G-8,), a= pot (G8), I= ps +( G83), . (9) 
pr=E2-(E.3), pr=G?- (G8), Ps=EP—(G 83), (10) 
equations (8) become 
r= qe +P = Gat Po’s 13= 4g + Ps (11) 
Let us also set, for brevity, 
B B 
G,= A,(1 a “NG +p,8,); G.=- (a af iGo p22); 
(12) 
B 
S3= A,(1 vr =t) (G, + p38). 
3 


Then G,, S,, GS, may be regarded as functions respectively of 
P1> Po» Pgs therefore of g,, Yo, Y3, and if we set 


dS dS dS 
So = peed 4 S’= 2. CS” =——3, 138 
dq, dq, dqs a 
and 6=6,4+9,+,, (14) 
we shall have 
dG =C'd¢,+ S’dq,+ S” dq. (15) 
To determine the value of ©’, we get by differentiation 
; B 3B, dr 
S = 4,453) 8-453 dq, Sat e:8y)- (16) 
But by (11) 
dr, 4 
cae (17) 
Therefore 
’ 3B 
ay ig \ ice aad 191 
S 6! oe =) 1 re @ + B,r,-*) S, 
yey ee BN 3B,q 
Om (1558) it rr ae 
aa B 3B 
| leas gee 393 
4 a :s 2) 8s r3°(1+B3r3~%) Ss 


Now if any values of q,, 75, d3 (either assumed or obtained by a 
previous approximation) give a certain residual © (which would be 
zero if the values of q,, q5, dg satisfied the fundamental equation), and 
we wish to find the corrections Ag,, Agq,, Aq;, which must be added 
to %1, Ya, Ys to reduce the residual to zero, we may apply equation (15) 
to these finite differences, and will have approximately, when these 
differences are not very large, 


—S=G'Agq, + S’Ag, +S Ags. (19) 


DETERMINATION OF ELLIPTIC ORBITS. 123 
This gives * 
(eo"”) (SS'""S’) nN dis (oS’ (SS'S") (20) 
(S'S’"S”) (o'S"S”) G3 (Sore o"e”’y 
From the corrected values of ¢,, 99, 73 We may calculate a new residual S, 


and from that determine another correction for each of the quantities 
Q> Yo. V3- 

It will sometimes be worth while to use formule a little less simple 
for the sake of a more rapid approximation. Instead of equation (19) 
we may write, with a higher degree of accuracy, 


—G=G6'Ag, + S’Aq.+ SG’ Ag, + $E(Ag,)?+ 32"(Ag.)?+$2'(Ag,), (21) 
where 


Aq,=- Aq,.= — 


Ae 3 ee a 2 B, d?(r,~*) 

£ = iis =2A.B, oi + Ea re Saga? S, 

omtid =e 7 é d ae oy B d? r,-8 

ne a 2B, wor I my ae =3 aah de, (22) 
oe ae oe 3) B, —_ d?(r,78) 

= Rae = 2A,B. Ost 1+Byr, > dq,? S, 


It is evident that ©” is pe many times greater than Z’ or T’”, 
the factor B,, in the case of equal intervals, being exactly ten times as 
great as A,B, or A,B,. This shows, in the first place, that the accurate 
determination of Ag, is of the most importance for the subsequent 
approximations. It also shows that we may attain nearly the same 
accuracy in writing 
—G=G6'Ag, +S’ Ag, + S’Ag, +42’ Aq,?. (23) 

We may, however, often do a little better than this without using 
a more complicated equation. For %’+2” may be estimated very 
roughly as equal to $2”. Whenever, therefore, Ag, and Aq, are about 
as large as Aq,, as is often the case, it may be a little better to use the 
coefficient 5%, instead of 3 in the last term. 

For Aq,, then, we have the equation 


2: (CICMICS) = (o'S’"S”) Aq. tis (2'S'""S') Aq,” (24) 

(Z’S’"S’) is easily computed from the formula 
(2"S"S')= -(- pda 2, )((SS'S")+&S"S)) 
(25) 

De Ya Wht 
- Qos? (1 +3) (8,9 S), 
which may be derived from equations (18) and (22). 

*These equations are obtained by taking the direct products of both members of the 


preceding equation with G6” x 6”, G’”xG’, and ©’x 6”, respectively. See footnote 
on page 119. 
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The quadratic equation (24) gives two values of the correction to be 
applied to the position of the body. When they are not too large, 
they will belong to two different solutions of the problem, generally 
to the two least removed from the values assumed. But a very large 
value of Ag, must not be regarded as affording any trustworthy 
indication of a solution of the problem. In the majority of cases we 
only care for one of the roots of the equation, which is distinguished 
by being very small, and which will be most easily calculated by a 
small correction to the value which we get by neglecting the quadratic 
term.* 

When a comet is somewhat near the earth we may make use of the 
fact that the earth’s orbit is one solution of the problem, Le., that —p, 
is one value of Agq,, to save the trifling labor of computing the value 
of (t’S’S’). For it is evident from the theory of equations that 
if —p, and z are the two roots, 

sees (So 2 pees (SG’S’) 
me 3(2"S6"'S’) Pe CCIM bli 


Eliminating (I"S6’"G’), we have 
(ps ae z)(SS”’S’) ye Poe (o'o’S”), 


whence 
foe es WAS 
Z Pe (SG6”S’)' 
a Com / . 
Now - See" is the value of Ag,, which we obtain if we neglect 


the quadratic term in equation (24). If we call this value [Aq,], we 
have for the more exact value tf 


_ [Aq] 
Aq, CCA al (26) 


Pa 
The quantities Ag, and Aq, might be calculated by the equations 


Mr 7 / 4 4 ia id “ss 
—(SS'S") =(S' S'S") Ag, + (T'S"S") Age? (27) 

/ “/ 4 4 7 Ud / 7 

— (SS'S") =(S'S"S") Ags t+ 35 (B" S'S") Ag,? 
*JTn the case of Swift’s comet (V, 1880), the writer found by the quadratic equation 

— ‘247 and —- "116 for corrections of the assumed geocentric distance 250. The first of 
these numbers gives an approximation to the position of the earth; the second to that 
of the comet, viz., the geocentric distance *134 instead of the true value °1333. The 
coefficient zy was used in the quadratic equation; with the coefficient % the approxi- 
mations would not be quite so good. The value of the correction obtained by neglecting 
the quadratic term was ‘079, which indicates that the approximations (in this very 
critical case) would be quite tedious without the use of the quadratic term. 
+In the case mentioned in the preceding footnote, from [Ago]= — 079 and p,=:25 
we get Ag,= -—°1155, which is sensibly the same value as that i os 
Ne ara at obtained by calculating 
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But a little examination will show that the coefficients of Aq,” in 
these equations will not generally have very different values from the 
coefficient of the same quantity in equation (24). We may therefore 
write with sufficient accuracy 


Aq, =[Aq]+Agq.—[Age], Ags=[Ags]+Aq.—[Aga], (28) 


where [Aq,], [Aq.], [Ags] denote the values obtained from equations (20). 

In making successive corrections of the distances q,, qo, 73, it will 
not be necessary to recalculate the values of G’, SG’, G’”, when these 
have been calculated from fairly good values of q,, go, 7g. But when, 
as is generally the case, the first assumption is only a rude guess, the 
values of ©’, SG”, GS’ should be recalculated after one or two corrections 
of 9, Yo, 43. To get the best results when we do not recalculate ©’, 
©”, S’”, we may proceed as follows: Let G’, SG”, G6” denote the values 
which have been calculated; Dg,, Dq., Dg, respectively, the sum of 
the corrections of each of the quantities q,, q., dg, which have been 
made since the calculation of ©’, G”, G’’; © the residual after all the 
corrections of q,, qo, 3, Which have been made; and Aq,, Aq, Ags 
the remaining corrections which we are seeking. We have, then, 
very nearly 


—S={6'+ 2 (Dg, +3Aq)} Ag+ {S"+2"(Dqn+3Aq)} pai (29) 
+{S"4+ 2" (Da3+ A93)} Ags. 

The same considerations which we applied to equation (21) enable 
us to simplify this equation also, and to write with a fair degree of 
accuracy 

—(SS"S')= ((S'S'S") + F(V'S"S)(Dyq+2Aq)} Ag, (80) 
Aq, =[Aq]+4q,—[Ag], Ags=[Ag,]+Ag—lAg], (31) 

where ,, opt ee, 
(Wal=-Gerer [ul--(Gerery ul=-(Serery © 


Correction of the Fundamental Equation. 


When we have thus determined, by the numerical solution of our 
fundamental equation, approximate values of the three positions of 
the body, it will always be possible to apply a small numerical 
correction to the equation, so as to make it agree exactly with the 
laws of elliptic motion in a fictitious case differing but little from 
the actual. After such a correction the equation will evidently apply 
to the actual case with a much higher degree of approximation. 

There is room for great diversity in the application of this principle. 
The method which appears to the writer the most simple and direct is 
the following, in which the correction of the intervals for aberration 
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is combined with the correction required by the approximate nature of 
the equation.* ; 

The solution of the fundamental equation gives us three points, 
which must necessarily lie in one plane with the sun, and in the lines 
of sight of the several observations. Through these points we may 
pass an ellipse, and calculate the intervals of time required by the 
exact laws of elliptic motion for the passage of the body between 
them. If these calculated intervals should be identical with the given 
intervals, corrected for aberration, we would evidently have the true 
solution of the problem. But suppose, to fix our ideas, that the 
calculated intervals are a little too long. It is evident that if we 
repeat our calculations, using in our fundamental equation intervals 
shortened in the same ratio as the calculated intervals have come out 
too long, the intervals calculated from the second solution of the 
fundamental equation must agree almost exactly with the desired 
values. If necessary, this process may be repeated, and thus any 
required degree of accuracy may be obtained, wheriever the solution of 
the uncorrected equation gives an approximation to the true positions. 
For this it is necessary that the intervals should not be too great. It 
appears, however, from the results of the example of Ceres, given 
hereafter, in which the heliocentric motion exceeds 62° but the 
calculated values of the intervals of time differ from the given values 
by little more than one part in two thousand, that we have here not 
approached the limit of the application of our formula. 

In the usual terminology of the subject, the fundamental equation 
with intervals uncorrected for aberration represents the first hypothesis; 
the same equation with the intervals affected by certain numerical 
coefficients (differing little from unity) represents the second hypothesis; 
the thord hypothesis, should such be necessary, is represented by a 
similar equation with corrected coefficients, etc. 

In the process indicated there are certain economies of labor which 
should not be left unmentioned, and certain precautions to be observed 
in order that the neglected figures in our computations may not 
unduly influence the result. 

It is evident, in the first place, that for the correction of our funda- 
mental equation we need not trouble ourselves with the position of the 
orbit in the solar system. The intervals of time, which determine this 
correction, depend only on the three heliocentric distances 1, Ne, 1% Ome 
the two heliocentric angles, which will be represented by v,—v, and 
Uz—V,, if we write v,, v,, Vz for the true anomalies. These angles 
(v,—v, and v;—v,) may be determined from 1,, 75,73 and 1,, N», Tes 


* When an approximate orbit is known in advance, we may correct the fundamental 
equation at once. The formule will be given in the Summary, § xii. 


i pails | 
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and therefore from 7,, 7, 7; and the given intervals. For our funda- 
mental equation, which may be written 


MR, — NRK, + 7,R3 = 0, (33) 
indicates that we may form a triangle in which the lengths of the 
sides shall be nyr,, nyr,, and ng’, (let us say for brevity, s,, s,, 8,), and 
the directions of the sides parallel with the three heliocentric directions 
of the body. The angles opposite 8, and s, will be respectively v;—, 
and v,—v,. We have, therefore, by a well-known formula, 


al antes A (81 — 82 +83)(8 +8) — 83) 


Re» (8, +8, +83)( —8,+8)+8,) (34) 
eight Via ee Ga a) 
2 (8, +58, +83)(8, +8, —83) 


As soon, therefore, as the solution of our fundamental equation 
has given a sufficient approximation to the values of 7,, 75, 73 (say 
five- or six-figure values, if our final result is to be as exact as 
seven-figure logarithms can make it), we calculate n,, n,, n, with 
seven-figure logarithms by equations (2), and the heliocentric angles 
by equations (34). 

The semi-parameter corresponding to these values of the heliocentric 
distances and angles is given by the equation 
of Ny? — Ngo + NPs ( 3 5) 


P r 
Ny — Ng +N 


The expression ”,—7,+7,, which occurs in the value of the semi- 
parameter, and the expression 1,7;—%,7,+3"3, OF 8,—8,+8,, which 
occurs both in the value of the semi-parameter and in the formule for 
determining the heliocentric angles, represent small quantities of the 
second order (if we call the heliocentric angles small quantities of the 
first order), and cannot be very accurately determined from approxi- 
mate numerical values of their separate terms. The first of these 
quantities may, however, be determined accurately by the formula 


A,B, By eB, (36) 


N,—N, +03 = 
ess Tia 3 3 
eee eS 5 


With respect to the quantity s,—s,+5,, a little consideration will show 
that if we are careful to use the same value wherever the expression 
occurs, both in the formule for the heliocentric angles and for the 
semi-parameter, the inaccuracy of the determination of this value from 
the cause mentioned will be of no consequence in the process of 
correcting the fundamental equation. For although the logarithm 
of s,—s,+8, as calculated by seven-figure logarithms from 7, 12, 1 
may be accurate only to four or five figures, we may regard it as 
absolutely correct if we make a very small change in the value of one 
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of the heliocentric distances (say 7,). We need not trouble ourselves 
farther about this change, for it will be of a magnitude which we 
neglect in computations with seven-figure tables. That the helio- 
centric angles thus determined may not agree as closely as they might 
with the positions on the lines of sight determined by the first solution 
of the fundamental equation is of no especial consequence in the 
correction of the fundamental equation, which only requires the exact 
fulfilment of two conditions, viz., that our values of the heliocentric 
distances and angles shall have the relations required by the funda- 
mental equation to the given intervals of time, and that they shall 
have the relations required by the exact laws of elliptic motion to 
the calculated intervals of time. The third condition, that none of 
these values shall differ too widely from the actual values, is of a 
looser character. 

After the determination of the heliocentric angles and the semi- 
parameter, the eccentricity and the true anomalies of the three 
positions may next be determined, and from these the intervals of 
time. These processes require no especial notice. The appropriate 
formulee will be given in the Summary of Formule. 


Determination of the Orbit from the Three Positions and the 
Intervals of Time. 


The values of the semi-parameter and the heliocentric angles as 
given in the preceding paragraphs depend upon the quantity s,—s,+8z, 
the numerical determination of which from s,, s,, and s, is critical to 
the second degree when the heliocentric angles are small. This was 
of no consequence in the process which we have called the correction 
of the fundamental equation. But for the actual determination of the 
orbit from the positions given by the corrected equation—or by the 
uncorrected equation, when we judge that to be sufficient—a more 
accurate determination of this quantity will generally be necessary. 
This may be obtained in different ways, of which the following is 
perhaps the most simple. Let us set 


6,=S,-G,, (37) 
and s, for the length of the vector ©,, obtained by taking the square 
root of the sum of the squares of the components of the vector. It is 
evident that s, is the longer and s, the shorter diagonal of a parallelo- 
gram of which the sides are s, and s,. The area of the triangle having 
the sides s,, 8, 8, is therefore equal to that of the triangle having the 
sides s,, 8, 8,, each being one-half of the parallelogram. This gives 


(8, +8) +83)(—8, +8, +85)(8, — 8p +83)(8;+8,—8s) 
= (8, +8, +83)( — 8, +8, +83)(8, — 8, +85)(8, +8, — 8), (38) 


: 
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aug 8, — 8p +9, = 81 84+ 85) = 8, 184+ 85)(8 — 8+ 84)(8, +84 — 8) (39) 
(8, +8. +83)( — 8, +8) +85)(8; +8) — 83) ; 

The numerical determination of this value of s,—s,+8, is critical 

only to the first degree. 

The eccentricity and the true anomalies may be determined in the 
same way as in the correction of the formula. The position of 
the orbit in space may be derived from the following considerations. 
The vector —G, is directed from the sun toward the second position 
of the body; the vector ©, from the first to the third position. 
If we set 


2 


eve SNe Sars S,, (40) 


the vector ©, will be in the plane of the orbit, perpendicular to —G, 
and on the side toward which anomalies increase. If we write 8, for 
the length of G,, 
| a and = 

85 5 
will be unit vectors. Let 3 and 9 be unit vectors determining the 
position of the orbit, 3 being drawn from the sun toward the peri- 
helion, and Y at right angles to 3, in the plane of the orbit, and on 
the side toward which anomalies increase. Then 


$= —cos Se an es (41) 
3 85 

Y= -—sin vy =2-4 008 vg, (42) 
2 5 


The time of perihelion passage (7) may be determined from any 
one of the observations by the equation 


“(i-D=B-¢ sin L, (43) 


the eccentric anomaly H being calculated from the true anomaly v. 
The interval t—7Z in this equation is to be measured in days. A 
better value of 7’ may be found by averaging the three values given 
by the separate observations, with such weights as the circumstances 
may suggest. But any considerable differences in the three values 
of 7 would indicate the necessity of a second correction of the 
formula, and furnish the basis for it. 
For the calculation of an ephemeris we have 


= —aeS+cos HaS+sin HbY (44) 


in connection with the preceding equation. 
Sometimes it may be worth while to make the calculations for the 
correction of the formula in the slightly longer form indicated for 
G. II. I 
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the determination of the orbit. This will be the case when we wish 
simultaneously to correct the formula for its theoretical imperfection, 
and to correct the observations by comparison with others not too 
remote. The rough approximation to the orbit given by the un- 
corrected formula may be sufficient for this purpose. In fact, for 
observations separated by very small intervals, the imperfection of 
the uncorrected formula will be likely to affect the orbit less than 
the errors of the observations. 

The computer may prefer to determine the orbit from the first and 
third heliocentric positions with their times. This process, which has 
certain advantages, is perhaps a little longer than that here given, and 
does not lend itself quite so readily to successive improvements of the 
hypothesis. When it is desired to derive an improved hypothesis 
from an orbit thus determined, the formule in § XII of the summary 
may be used. 


SUMMARY OF FORMULA: 
WITH DIRECTIONS FOR USE. 
(For the case in which an approximate orbit is known in advance, see XII.) 
L. 
Preliminary computations relating to the intervals of time. 


t,, t,, t;=times of the observations in days. 
log k = 8'2355814 (after Gauss) 


7, =k(t,—t,) T3=k(t,—t,) 
t, —t, t,—t. 
pe ee SRS es 
1 t3-t, i eae 
—T 24 +. 2 2 : 2 2 ee pam e B 
B= 1 ae T3 B,=2 ae id B= ae ice T3 
For control : A,B, + B,+ A,B, =47,73, 
I. 


Prelaminary computations relating to the first observation. 


X,, Y,, 4, (components of &,)=the heliocentric coordinates of the 
earth, increased by the geocentric 
coordinates of the observatory. 

£,,m, ¢ (components of §,) =the direction-cosines of the observed 
position, corrected for the aber- 
ration of the fixed stars. 


C2=X2+ Y?+ a (5) = XE + Yin, a 45 pr= €? —(€,.5, 


ees 
x b 
te 
y 
4 
i 


eae eS Vis Weeds —— Nude ah * 
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Preliminary computations relating to the second and third 
observations. 
The formule are entirely analogous to those relating to the first 


observation, the quantities being distinguished by the proper suffixes. 


fat 
Equations of the first hypothesis. 


When the preceding quantities have been computed, their numerical 
values (or their logarithms, when more convenient for computation,) 
are to be substituted in the following equations : 


Components ai ©; 


= pit(G.¥,) =A A+2)(q,+ a —(G, &)) 
ro=qe+p? B= +2)(n+ 2-6 )) - Uh 
B, 
R, PRs 71 =410(1+ B,) (ate —(&, 8) 
For control : s2=a2+B2+y2=ALOA+8,)?r? 


Components of ©’ 
@=A,E+4,62,-—P a, 
B=Ayn+4Aynh, -—P'B, IT’ 
¥=4,04+4, 68, -P'y, 


Components of ©, 


Xx. 
a= pot (G-82) a, = — £(1—R,)(q2+ 2 (Gd,)) 


3Rq, 


Sa (1+8,)r? 


Vg 
Ty =qy +p, B2= —1,(1—R,) (g+53 ey (G.8.)) Til, 
Z 
R,= 73 n= —GU-B,)(a+ 2-8) 
For control : 82 =a2+ 6 2+y?=(1—R,)r,? 


Components of ©” 


‘= —-6£+6R,+P'a 
3a, a’ £5 2 2 


yet, | SOA ie wy } Pp” TI’ 
P= ce R,yre Bp" No Nok, 8 Bs 
y= &+6R,+P Y2 
Components of ©; 
Z-Gby) 
=pyt(EB) = AG 1+R)( 9+ - (GO) 
= 93’ +73" B,=Agn,(1+ Rs) (q+22 Ne — (&. 3) IIi, 
Z. 
— y= Alt R(gte! — (G,.8,)) 
3 
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For control : s2=as2+Be+ye=As(1+R,) 3" 
Components of 6” 

a” = A,f,+ A,£,R, m= Laois 

B= Agng+Agn,Rs— P"Bs m1” 

y=A,f+ A, (Rh, —P''ys 


The computer is now to assume any reasonable values either of the 
geocentric distances, p,, py, p3, Or of the heliocentric distances, 7,, 72, 
r, (the former in the case of a comet, the latter in the case of an 
asteroid), and from these assumed values to compute the rest of the 
following quantities : 


3Ros 


fg Ae eects eH ESS 
(1+ #,) Va 


By equations III,, IT’. By equations III,, III”. By equations III,, II”. 


Nh vp) Ys 
log 7, log Tr, log rg 
log R, log R, log R, 
log(1+R,) log (1—R,) log (1+ Ry) 
log P’ log P” log P’” 

Gy ag as 

By B. Bs 

V1 2 ¥3 

a a a 

es B gp" 
¥ e gh 
IV. 

Calculations relating to differential coefficients. 
Components of 6’x6’” Components of 6’””’x @’ Components of G’xG” 
a = B'y” — y"B" dy= By — yy" B ds= By" —'B" 
b= ya” — ay” b= ya — aly’ b= ya" —a'y” 


Li = ap ise Bia” = = a” a= Bra G, — a’ 8” = Ba” 
G oon (S'S’S””) = aa + b,0’ +¢,y’ = Mya” + Dio: + Coy” — dea” + 6,8 + Cay”. 


These computations are controlled by the agreement of the three 
values of G. 
The following are not necessary except when the corrections to be 
made are large: 
H=(§,S'"S’) = ag, + dato + Cols 


raN(asH) (0-088) BBs 
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¥, 


Corrections of the geocentric distances. 
Components of ©. 
a=a,t+a,+as, O,= Bat hB tay 


2 


B=8,+8,+ Bs C= — MAE B cay 


YENtTVet+Ys Q,= — BEES coy 


Aq, = C,—x5L(Aq,). 


(This equation will generally be most easily solved by repeated 
substitutions.) 


Aq, = C,—L(Agq.)* Aqs= C3— y%5L(Aq,). 


VI 


Swecessive corrections. 


Aq,, Aq,, Ags are to be added as corrections to q,, qs, 93. With the 
new values thus obtained the computation by equations III,, III,, III, 
are to be recommenced. Two courses are now open: 

(a) The work may be carried on exactly as before to the deter- 
mination of new corrections for q,, qs, qs- 

(6) The computations by equations III’, IIT”, ITI”, and IV may be 
omitted, and the old values of a,, b,, ¢,, Gd, etc. G, and LZ may be 
used with the new residuals a, 8, y to get new corrections for q,, qo, 
q3 by the equations 

Pe BO 
14+ $L(Dq, +305)’ 
Aq, =(,+Aq.—©;, Ags= (3+ Ags — C,, 


where Dg, denotes the former correction of q,. (More generally, at 
any stage of the work, Dq, will represent the sum of all the corrections 
of g, which have been made since the last computation of a,, b,, etc.) 

So far as any general rule can be given, it is advised to recompute 
a,, b,, etc. and G once, perhaps after the second corrections of q,, 5, 
gs, unless the assumed values represent a fair approximation. Whether 
L is also to be recomputed, depends on its magnitude, and on that of 
the correction of g,, which remains to be made. In the later stages 
of the work, when the corrections are small, the terms containing L 
may be neglected altogether. 

The corrections of q,, q5, 3 Should be repeated until the equations 


a=0 B=0 y=0 
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are nearly satisfied. Approximate values of 7,, 72, 7; may suffice for 
the following computations, which, however, must be made with the 


greatest exactness. 
VIL. 


Test of the first hypothesis. 
log7,, log7r,, log 7, (approximate values from the preceding 
computations). 


N=A, By? + Byrz*+ A,Berz" 
8 =A yr, +4,By7" 
8, =1,— Byry? 
8;= A wr,+A;Byr5” 
8=4(8,+8+85) 
8—8, S—8, 8S—83. 
The value of s—s, may be very small, and its logarithm in con- 
sequence ill determined. This will do no harm if the computer is 


careful to use the same value—computed, of course, as carefully as 
possible—wherever the expression occurs in the following formule : 


pn MOBO) tan jm 


8—8, 

_ 2(8—8) ee 
| eis ck AER igen 
tan 3(%—%))=-— 


For adjustment of values: 4$(v;—v,)=4$(v,—¥,)+4(v3—%,) 


1 ad 
: ry 
dees hi Yuba Les 2 sin oes ) 
2 1 
ee: 
1 Rede eo DR 
€ COS a(%a thu) Saas ae 
3 ‘t 
tan 3(v3+2,) e 
For control : € COS UV, = B —1 
2 
. yi—e po 
e= 52 POEL Lage 
tan $4, =e tan 40, tan +H, =e tan iv, tan }#,=e tan 1v, 


3 3 nels 
T1 calc, = @*(H, — H,) +ea* sin H, — ea? sin Es 


3 a bs 
T3 calc. = ar EK, a E,) +ea? sin Ei, — ea? sin Ei, 


a 
: 
- 
f 


on a a 
\ 
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For the second hypothesis. 
87, ='0057613k(p.— ps) (aberration-constant after Struve.) 
d73 = '0057613k(p, — ps) log (0057618k) = 5:99610 
A log 7, =log 7, —log (Ty eat, — 67) 
A log 7;=log 73—log (75 cate, — 873) 
A log (7,7;)=A log 7, +A log 7, 


Alog “=A log r,—A log 7, 
3 


A log A,= —4A,A log 3 


3 


Alog A,=—A A log ™ 4 


A log B,= =A log (773) in cs A lo og =! 
A log B,= A log (7,73) += ea A log = is 
A log B,= A log (7,73) + 12 oB. DA log 


These corrections are to be added to the logarithms of A,, As, 
B,, B,, B;, in equations III,, IiI,, II,, and the corrected equations 
used to correct the values of q,, q2, q3, until the residuals a, 8, y 
vanish. The new values of A,, A, must satisfy the relation 
A,+A,=1, and the corrections Alog A,, Alog A, must be adjusted, 
if necessary, for this end. 


Third hypothesis. 


A second correction of equations III,, III,, III, may be obtained in 
the same manner as the first, but this will rarely be necessary. 


TX. 


Determination of the ellipse. 
It is supposed that the values of 
ay By Vp dy, Bo, Yo dz» Bsr Ya» 
fo, Ts, EB, -clibge Legs 81, 8, 83, 
have been computed by equations III,, IlI,, II, with the greatest 
exactness, so as to make the residuals a, 8, y vanish, and that the 


two formuls for each of the quantities s,,s,, 8; give sensibly the same 
value. 
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Components of S; 


Components of ©, 
__ 40g F BiBo+Yi¥2 ms 
Ba" : 


A= Ag — Ay as = 


a0, + B,B.+ 472 
B,=B3— By B;=Pa— 82 YAY B, 


ado + BiB. + yay 
Vien Y S aad. efiat es ae —- ie ‘ 25 


sP=aPt+Bet+ye 82° =as + Bs +y5° 
8=4$(8,+8)+85) S = }(8,+8,+8s) 


For control only : 
S(S—s,)(S—s,)(S—8s) 


oa (8 —8,)(8—85) 


pee SIS 8i)(S—s,)(S—ss) tan }(u,— 0) = —— 


8," — 8; 

R 
N=A,R,+R#,+A4,R, tan 3(Ug— Vs) = 
1 


2 R28 Rs 
P=Ne=s)6—%) tan $(%3—*)=G— 3 y(e—6,) 


The computer should be careful to use the corrected values of A,, As. 
(See VIII.) ‘Trifling errors in the angles should be distributed. 


foie 
e sin $(v,;+,)= 9 ae ce v1) 
P er on a6 
€cos £(U3+,)= 2 cos a —) 
tan }(v,+2,) e 
For control : € COS Uy = ‘ Bs 


l—e 
EA Tong a=;Fa b= Sop) 


Direction-cosines of semi-major axis. 


COS UV, sin v, 
l=— = 5 Shs 
89 gt 
C08 sin v 
bal Ve NV, 
1 ee Bs 
COS VU, sin Vv, 
de cae oe V5 
8 8. 


ty Bie ela ea Re Ses 
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Direction-cosines of semi-minor axis. 
sin v cos Vv 
A == : a+ 2 as 
8 8 
2 5 


cos u 
By+ * Bs 


89 85 


_  sinv, 


SIM VU, cos V9 


85 Y2 S5 ae 


v= — 


Components of the semi-axes. 


Ay, = al dy = am Az,=an 
bz =br by = bp b,=by 
X. 


Time of perihelion passage. 


Corrections for aberration. 


tan 42, =e tan }v, 6t, = — 0057613 p, 
tan $£,=e tan }v, ot, = — 0057613), 
tan }H#,=e tan }v, dt, = — ‘0057613, 


log 0057613 =7°76052 
t, +6t, —T=k-1a?(E,—esin £,) 
t,+6t,-—T=k-1a+(E,—esin E,) 
t,+6t, —-T=k-1a?(H,—esin E,) 

The threefold determination of 7 affords a control of the exactness 
of the solution of the problem. If the discrepancies in the values of T 
are such as to require another correction of the formule (a third 
hypothesis), this may be based on the equations 

Ly Ley Ly~ Ly 
Alog 7,=M—, Alog7;=M—,—| 
3 2 2 1 
where 7), Zi), Tis) denote respectively the values obtained from the 
first, second, and third observations, and M the modulus of common 
logarithms. 
XI. 


For an ephemeris. 
ae sin # 
a? 


Heliocentric coordinates. (Components of Ht.) 
2= —ed,+a,cos H+b, sin L 
y = — ed, +a, cos H+b,sin # 
z= —ea,+a,cosH+b, sin HL 
These equations are completely controlled by the agreement of the 
computed and observed positions and the following relations between 


the constants : 
Ggdn + dyby + ab,=0 2 +a,2+a2= a b2+b,2+b2=(1—€)a? 
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XII. 


When an approximate orbit is known in advance, we may use it to 
improve our fundamental equation. The following appears to be the 
most simple method : 

Find the excentric anomalies H,, E,, H,, and the heliocentric 
distances 1, 72, 73, Which belong in the approximate orbit to the times 
of observation corrected for aberration. 

Calculate B,, Bz, as in §I, using these corrected times. 

Determine A,, A, by the equation 


B, iS, 
A,(1+54) A(1+74) 
sin (#,—,)—esin ‘E+ esin, sin(£,—#,)—esin £,+esin L, 


in connection with the relation A,+A,=1. 
Determine B, so as to make 


Bi Be Bs 
A, ie = m —+4 Ass 
4sin }(#,—£,) an 4 eon =a Ay a 3(#,—£)) 
equal to either member of the last equation. 

It is not necessary that the times for which L,, E,, E3, 71,72, 73; 
are calculated should precisely agree with the times of observation 
corrected for aberration. Let the former be represented by t,’, t,’, t,, 
and the latter by t,”, t,”, t,’; and let 


A log 7, = log(¢,” —t,”) —log(t,’ —t,), 
A log 7,= log (t,” —t,”) —log(t, —t,’). 


We may find B,, B,, A,,A;, B,, as above, using t,’, t,’, t,, and then use 
Alog7,, Alog 7; to correct their values, as in § VIII. 


Numerical Example. 


To illustrate the numerical computations we have chosen the 
following example, both on account of the large heliocentric motion, 
and because Gauss and Oppolzer have treated the same data by their 
different methods. 

The data are taken from the Theoria Motus, §159, viz., 


Times, 1805, September — - 5°51336 139°42711 265 °39813 
Longitudes of Ceres. - - 95° 32! 18”°56 99° 49’ 5”:87 118° 5! 28"°85 
Latitudes of Ceres - - —0° 59’ 34”:06 +7 16’ 36”°80 +T7° 38’ 49"°39 
Longitudes of the Earth - | 342° 54’ 567-00 117° 12’ 437-25 241° 58’ 50”*71 
Logs of the Sun’s distance - | 0°0031514 9:9929861 0:0056974 


The positions of Ceres have been freed from the effects of parallax 
and aberration. 


i 
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From the given times we obtain the following values: 


Numbers, Logarithms. 

to—t, | 1833-91375 21268252 
ts—t, | 125-97102 21002706 
ts —t, 25988477 2°4147809 
A, 4847187 9°6854897 
A; 5152812 9°7120443 
7 “3358520 
T; 3624066 
B, 9°6692113 
B, "3183722 
Bs 95623916 


| 


Control : 
A,B, + B,+A,B,= 24959086 
$77, = 2°4959081 
II. 


From the given positions we get: 


log X, | 9°9835515 | + || log X, | 9°6531725 


| — || log X, | 9°6775810 | — 
log ¥, | 9°4711748 | - | log ¥, | 9-9420444 | + |] logY, | 9-9515547 | - 
cA 0 = 0 Zs 0 
logé, | 8-9845270 | — || logé, | 9-2282738 | - || loge, | 96690204 | - 
log m 9-9979027 | + log no 9°9900800 | + || logn, 9°9416855 | + 
logg, | 8-2387150 | — || logs, | 91026549 | + || logy, | 9-1240813 | 4+ 
G,.%, 3874081 | — || GG 9314223 | + || G.% | 5599304 | - 
p? 8645336 | + pe 1006681 | + ps 7130624 | + 
tir 


The preceding computations furnish the numerical values for the 
equations III,, III’, HI,, II’, II,, II”, which follow. Brackets 
indicate that logarithms have been substituted for numbers. . 

We have now to assume some values for the heliocentric distances 
,, 7,3. A mean proportional between the mean distances of Mars 
and Jupiter from the Sun suggests itself as a reasonable assumption. 
In order, however, to test the convergence of the computations, when 
the assumptions are not happy, we will make the much less probable 
assumption (actually much farther from the truth) that the helio- 
centric distances are an arithmetical mean between the distances of 
Mars and Jupiter. This gives 526 for the logarithm of each of the 
distances 7,, 7,7; From these assumed values we compute the first 
column of numbers in the three following tables : 
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91 =p; —'3874081 
r2=9+'8645336 
R,=[9°6692113]7r,-*? yy, 


, [47712] Rg, 


i= 


aR ne 
Aq — ‘66731 
a | + | 322606 | 255875 
logr, | + |  °52600 434960 
log R, | + | 809121 | 8-364331 
log(1+R,) | + | 00533 009934 
log P’ + 801967 8369626 
a | + 30136 *336506 
B, | + | 161938 | 1-307304 
nN = 03072 025316 
a’ - 050505 
Bp’ | + 47139 
y | - 00818 


Go= po+'9314223 


72= 92+ 1006681 


, [47712] Bog, 


a 


ar (1 — R,)79” 
Aq 
an 3°34235 
log 7, + 52600 
log Rp | + 8°74037 
log (1 — R,) + 9-97543 
log P” | + 8°71411 
Ga |) + "81059 
Ba | - 305379 
Yo - 28858 
al’ te 20182 
Bp" - 1:08177 
y’ = 13464 


93= pz —'0999304 
12 = qs" + 7130624 
R,=[9'5623916]r,-° 


PY’ 


= [47712] R.gs 
(1+ 8,)1," 


. 


B3= 


iS cae 


— 04558 
2°51317 
4280791 
8°3849740 
0104122 
8°3957468 
"3390605 
1°286223 
0249518 
0563438 
4620942 
0079821 


— 77826 
2°56409 
"412233 
9:081673 
9°944142 
9°199120 
638489 
2°172660 
181843 


+ 005042 
2°569132 
"4130733 
9°0791524 
9°9444866 
9°1954270 
6397466 
2°1787230 
1825486 
2491854 
1:2018221 
1400944 
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a, = —[8°6700167](q, —9°5901555)(1 + Ry) 
[9°6833924](q,+ °0900552)(1+ F,) ; ITT, 

—[7-9242047](q,+ °3874081)(1+ R,) 

a = —'046775 —[8°67002] R, —P’a, 

‘482383 + [9°68339] 2, —P’B, 

y’ = — 008399 —[7:92420] Rk, —P’y, 


— 0010434 
2°5142134 
"4282376 
8°3844985 
0104010 
8°3951457 
3390018 
1-2867056 
0249601 


+ °0013222 
2°5704542 
"4132934 
9-0784920 
9-9445766 
9°1944598 
6400760 
2°1803116 
*1827338 


| 


+ 0000006 
2°5142140 
4282377 


dy = +[9°2282738](q,+1°7286820)(1 — R,) 

B= —[9-9900800](g,— °0361309)(1—R,) 
R,=[03188722]r,-* y,= —[91026549](g,— -9314223)(1—R,) 
169151 —[9-22827]R, + P’a, 
B” = —-977417 + [999008] R, + PB, 
y’ = —'126664+[9-10265]R, + P’y, 


bn 


tO 


+ °0000021 
2:5704563 
*4132937 


as = — [93810737] (q,+1°5798163)(1+ R,) 
[9°6537308](q,— °4630521)(1+ R,) 
[8°8361256](q,+ -5599304)(1+ R,)| 
a” = —-240477 — [938107] R,— Pa, 

6” = +'450537 + [965373] R, —P’”B, | IV” 
y” = +:068569 + [883613] R, —P’”’y, 


III, 


> 
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Ags — “80780 — 04055 + 0025316 | +-0000031 
3 | + | 3°24945 | 244165 2°40110 2°4036316 | 2°4036347 
logrs | + | 0°52600 “412217 "4057319 “4061394 -4061399 
log R, | + | 7:98439 8°325742 | 83451948 | 8:3439733 
log(1+Rs) | + 00417 009099 0095108 0094843 
log P’” | + | 791715 8'357016 | 8:3817516 | 8-3801993 
a | = 1 15V7253 “987590 ‘9785152 ‘9790776 
B; | + | 126749 ‘910305 "8924956 *8936069 
ve | + 26373 ‘210171 ‘2075292 ‘2076940 
Pe) ee “22847 ‘2299335 
ge) + “44441 4390163 
gee ie “06690 “0650888 
! 
Ly 


The values of a’, 6’, ete., furnish the basis for the computation of 
the following quantities: 


a, = — 01254 y= — 03517 a, = — 07232 
b,=+-01726 b,=—-00525 b= —-00845 
¢,=—15746 = cg= — 08526 eg = — 04050 
For G we get three values sensibly identical. Adopting the mean, 
we set 
G=-01006. 
We also get H = — 00998, L = -02322.* 
¥: 


Taking the values of a,, a,, etc., from the columns under III, III,, 
III,, we form the residuals 


a= —"06058, B= —‘16692, y = — 05557. 
From these, with the numbers last computed, we get 
CO, = — 65888, C,= —'76983, C, = —°79939, 


which might be used as corrections for our values of ¢,, q,, q3. To 
get more accurate values for these corrections we set 
Agq,=C,—a%L(Ag,), or Ag,= —'76983—-01393(Aq,)’, 
which gives Aqs= — 77826. 
The quadratic term diminishes the value of Ag, by ‘00843. Sub- 
tracting the same quantity from C, and C, we get 


Aq, = —‘66731,  Aq,= —‘80780. 


*Tt would have been better to omit altogether the calculation of H and J, if the 
small value of the latter could have been foreseen. In fact, it will be found that 
the terms containing L hardly improve the convergence, being smaller than quantities 
which have been neglected. Nevertheless, the use of these terms in this example will 
illustrate a process which in other cases may be beneficial. 
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Applying these corrections to the values of q,, 2,9, we compute 
the second numerical columns under equations III,, T,, and II. 
We do not go on to the computations by equations IIT’, ete. but 
content ourselves with the old values of a, 0,, ete, G, and L, which 
with the new residuals 

a= —‘012595, B=044949, y = 003012, 


give C= — 04567, C,='004952, C,= —-04064. 
Ago = 0, — L( Day +40) Agg = 004952 — 02322 ( — “77826 + 00247) Aq. 
This gives Aq, = "005042. 


As the term containing Z has increased the value of Ag, by 00009, 
we add this quantity to O, and C,, and get 
Agq,=—"04558,  Agq,= —'04055. 


With these corrections we compute the third numerical columns 
under equations III,, etc. This time we recompute the quantities a’, 
etc., with which we repeat the principal computations of IV, and get 
the new values 


a, = —°0167215 d= — 0335815 a, = — 0743299 
b, = +:0149145 b, = —'0054413 b, = — 0098825 
¢,= —'1576886 Cy = — 0779570 ¢, = — 0474318 
G = 0090929. 
The quantities H and LZ we neglect as of no consequence at this 
stage of the approximation. 
With these values the new residuals 
a=+°'0002919, @=-—-0000044, y = +0000288, 
give Aq, =O,= + 0010434, Aq, = C,= + 0013222, 
Aq;= 0; = + 0025316. 
These corrections furnish the basis for the fourth columns of 
numbers under equations III,, ete., which give the residuals 
a= +°0000002, B= +:0000009, y = +0000001, 
and the new corrections 
Aq, = +:0000006, Aq,= + 0000021, Aq;= +°0000031. 
The corrected values of 9,, do, 93 give 
log 7, =0°4282377, log r,=0°4132937, log r,=0:4061399. 


We have carried the approximation farther than is necessary for 
the following correction of the formula, in order to see exactly where 
the uncorrected formula would lead us, and for the control afforded 
by the fourth residuals. 
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The computations for the test of the uncorrected formula (the first 
hypothesis) are as follows: 


: 

f Number or are. Logarithm. 

: z = Bal . 
ry 0°4282377 
; Yr, 04132937 
rs 0:4061399 
’ AB = 01174865 80699879 
| Boro + “11980944 9-0784911 
7 AsBys® | + ‘01137670 8°0560162 
N | + 14293479 91551380 
. a | + | 1°3308476 01241283 
a i) 22796616 03578704 
35 +4 1°3417404 0:1276685 
8 ~ 2°4761248 0°3937725 
4 8-8 + 1°1452772 00589106 
a a 01964632 92932812 
i a oe 1-1343844 0-0547602 
R + 95065898 
Pp + 0°4391732 
tan $(v-%) + 15° 48’ 10’:82 94518296 
tan}(v,-v,) | + 15° 39’ 36”-38 9-4476792 
tan}(vs—%,) | + 31° 27’ 477-20 9°7866915 
E esin }(v3+2) = 8-7099387 
s €cos$(v3+v,) + 8°7872701 
tan $(v,+%,) - — 39° 55’ 32"-31 99226686 
e + 89025438 
€ > 99652259 
a + 04419546 
tan 30, = — 35° 41’ 397-75 98563809 
tan hr, = —19° 53’ 287-93 95584981 
tan 40, = — 4° 13" 52"-55 88691380 
tan 32, = =33° 32. 0-17 98216068 
tan 42, =} = 18" os. 6"-35 95237240 
tan 42, = = 2 64 24" 21 88343639 
sin 2, ’ =67° & 07-34 99643473 
sin 2, = — 36° 56’ 12”-70 97788272 
sin 2, - ST 4 4942 9:1333734 
ea? sin EB, ~ ‘3387061 95298230 
ea? sin Ey - ‘2209545 9-3443029 
ea sin H; - “0499861 8°6988491 
a (Ey — Eh) ea 24226307 03842872 
a3(H, - E) +. 23391145 03690515 
ek $ 23048791 03626482 
ee es 21681461 03360885 


~~ 
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VIIL 


The logarithms of the calculated values of the intervals of time 
exceed those of the given values by 0002416 for the first interval (v3) 
and ‘0002365 for the second (7,). Therefore, since the corrections for 
aberration have been incorporated in the data, we set for the correction 
of the formula (for the second hypothesis) 

A log 7,= — "0002365 Alogr,= —‘0002416 
This gives A log A,=‘0000026 A log A, = —'0000025 
Alog B,= —'0004872  Alog B,= —-0004782 Alog B,= —-0004665 
The new values of the logarithms of A,, A; are 
log A, =9°6854928 log A, =9°7120418 
Applying these corrections to equations III, III,, I11,,* we get the 


folowing: 
72 =9,2+ 8645336 


a, = —[8°6700193] (g, —9°5901555)(1+R,) 


R,=[9°6687241]r,-? 


IIT, corrected. 


8, = +[9°6833950](q,+ -0900552)(1+ R,) 
y, = —[7°9242073] (q,4+ °3874081)(1+R,) 


Aq + *0002887 — 0000217 

ay 35 2°5142140 2°5145027 275144810 

log 7; + °4282377 “4282816 4282782 

log R, + 8-3838110 8°3838793 8°3838894 

log (1+ £,) + 0103847 0103863 0103865 
ay + "3389910 "3389784 "3389796 

By ae 12866654 1°2868124 12868024 

" - "0249593 0249619 "0249617 

log + 1241571 


12 = 9,2-+'1006681 


a = +[9:2282738] (q.+1:7286820)(1 — R,) 


R,=[0°3178940]7,-8 


III, corrected. 


B= —[9°9900800](q,— -0361309)(1— R,) 
yy = —[9°1026549](q,— °9314223)(1 — R,) 


Aqa — -0000955 | + -0000187 

% + 2°5704563 2°5703608 2°5703795 

log 7, + 0°4132937 “4132778 “4132809 

log R, + 9:0780129 9-0780605 9-0780513 

log (1 — R,) < 9°9446418 9°9446353 9°9446365 
ay cr 6401725 6401487 6401532 ~ 

Bo - 2°1806412 2°1805261 2°1805482 

Yo - "1827615 “1827481 1827507 

log s + 3579174 


* The corrections may be made without rewriting the equations. 
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Ts’ = 93" +°7130624 
as = —[9'3810712] (q,+1'5798163)(1-+ Ry) 
R;=[9'5619251]r,-8 III, corrected. 
= +[9°6537283](q,;— -4630521)(1+ R,) 
Ys= +[8'8361231](q,+ °5599304)(1+.R,) 


Aqs3 + °0003302 + 0000424 

qs + 2°4036347 2°4039649 2°4040073 

logr; | + 4061399 4061929 4061998 

log Ry + 8*3435055 8°3433463 8°3433257 
log(1+Rs) | + 0094742 0094708 0094704 
Qs - 9790500 9791236 *9791329 

Bs + "8935824 8937277 "8937461 

Ys at 2076882 “2077097 "2077124. 

log 85 + "1277120 


With these corrected equations the last values of q,, qs, d3 give 
the residuals 


a="0001135 B= —:0003934 y = — 0000326 
These give the corrections 
Aq, = 0002887 Aq, = — ‘0000955 Aq, = °0003302 


The next residuals are 


a='0000035 8=-0000140 y = — "0000003 
which give the corrections 
Aq, = — 0000217 Aq,=-0000187 Aq; = "0000424 
The next residuals are 
a= —‘0000001 8=:00000038 y ='0000000 


which must be regarded as entirely insensible. 


x 


It remains to determine the ellipse which passes through the points 
to which the numbers relate in the last columns under the corrected 
equations III,, III,, III,, and also the time of perihelion passage. 
The computations are as follows: 
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This gives the following equations for an ephemeris: 
T=1806, June, 2397450, Paris mean time 
[2°8863186](¢— 7) = Lin seconas — [4°2216270] sin E 
Heliocentric coordinates relating to the ecliptic. 
% = +°'1820700 —[0°3530366] cos H —[0°1827457] sin 
= — 1244685 + [0°1878576] cos H —[0°3603257] sin H 
2= — ‘0373970 + [9°6656346] cos H+[9°3320292] sin H 
The differences of the values of Zi), Zi), Zis, from their mean 7, 
indicate the residual errors of this hypothesis. They indicate differ- 
ences in the calculated and the observed geocentric positions which 
are represented by the geocentric angles subtended by the path 
described by the planet in the following fractions of a day: -00054, 
00003, 00052. Since the heliocentric motion of the planet is about 
one-fourth of a degree per day, and the planet is considerably farther 
from the earth than from the sun at the times of the first and third 
observations, the errors will be less than half a second in are. 
If we desire all the accuracy possible with seven-figure logarithms, 
we may form a third hypothesis based on the following corrections : 


Alog 7, = Ma i —-0000017, 


A log +,=M—2—= = —-0000018. 
ty t, 


The equations for an ephemeris will then be: 
T=1806, June 23:96378, Paris mean time 
[2°8863140](¢— 7) = Zin seconas — [4°2216530] sin # 
Heliocentric coordinates relating to the ecliptic. 

x= +'1820765 —[0°3530261] cos H—[0°1827783] sin # 

y = — 1244853 + [01878904] cos # —[0°3603153] sin & 

z= — 0373987 + [9°6656285] cos #+[9°3320758] sin & 
The agreement of the calculated geocentric positions with the data 

is shown in the following table: 


oe I a ee ee ee eee 
Times, 1805, September 5°51336 139°42711 26539813 
Second hypothesis : 
longitudes - - 95° 32’ 18’°88 99° 49’ 5”:87 118° 5! 28"*52, 
errors - = - 0”°32 0”-00 — 0°33 
latitudes - - | —0° 59 34”°01 7° 16’ 36”°82 7° 38 49”°34 
errors - - - 0”:05 002 —0”:05 
Third hypothesis : 
longitudes - - | 95° 32’ 18°65 99° 49’ 57-82 118° 5’ 28”-79 
errors - - - 0:09 — 0:05 — 0”:06 
latitudes - - | —0° 59’ 34”-04 7° 16’ 36”°78 7° 38’ 49’:38 
errors - - - 0” :02 — 0":02 —0’-01 


NN 
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The immediate result of each hypothesis is to give three positions of 
the planet, from which, with the times, the orbit may be calculated in 
various ways, and with different results, so far as the positions deviate 
from the truth on account of the approximate nature of the hypothesis. 
In some respects, therefore, the correctness of an hypothesis is best 
shown by the values of the geocentric or heliocentric distances which 
are derived directly from it. The logarithms of the heliocentric 
distances are brought together in the following table, and correspond- 
ing values from Gauss* and Oppolzert are added for comparison. It is 
worthy of notice that the positions given by our second hypothesis are 
substantially correct, and if the orbit had been calculated from the 
first and third of these positions with the interval of time, it would 
have left little to be desired. 


log 7. log rp. log r3. 

First hypothesis  - - | *4282377 4132937 4061399 
Second hypothesis - - | °4282782 4132809 4061998 
Third hypothesis - - | *4282786 4132808 4062003 
Gauss : 

First hypothesis - | 4323934 4114726 4094712 

Second hypothesis - | °4291773 *4129371 4071975 

Third hypothesis -| *4284841 4132107 4064697 

Fourth hypothesis - | °4282792 4132817 4062033 
Oppolzer : 

First hypothesis - | °4281340 413330 “4061699 

Second hypothesis - | °*4282794 “4132801 4061976 

Third hypothesis - | ‘4282787 4062009 


In comparing the different methods, it should be observed that 
the determination of the positions in any hypothesis by Gauss’s 
method requires successive corrections of a single independent variable, 
a corresponding determination by Oppolzer’s method requires the 
successive corrections of two independent variables, while the corre- 
sponding determination by the method of the present paper requires 
the successive corrections of three independent variables. 


* Theoria motus, § 159. 
t Lehrbuch zur Bahnbestimmung der Kometen und Planeten, 2nd ed., vol. i, p. 394. 


VI. 


ON THE USE OF THE VECTOR METHOD IN THE 
DETERMINATION OF ORBITS 


LetTeR To Dr. Hueco Bucuuorz, Eprror or KLINKERFUES’ Theoretische 
A stronomie.* 


New Haven, October, 1898. 

Dr. Huco BucuuHot.z, 

My dear Sir,—The opinion of Fabritius +t on the comparative con- 
venience of different methods is entitled to far more weight than 
mine, for I am no astronomer, and have calculated very few orbits, 
none, indeed, except for the trial of my own formule. The object of 
my paper was to show to astronomers, who are rather conservative 
(and with right, for astronomy is the oldest of the exact sciences), 
the advantage in the use of vector notations, which I had learned in 
Physics from Maxwell. This object could be best obtained, not by 
showing, as I might have done, that much in the classic methods 
could be conveniently and perspicuously represented by vector 
notations, but rather by showing that these notations so simplify 
the subject, that it is easy to construct a method for the complete 
solution of the problem. That the method given is the best possible, 
I certainly do not claim, but only that it is much better than I could 
have found without the use of vector notations. Some of the more 
obvious crudities in my paper have been corrected in that of Beebe and 
Phillips.t Doubtless many more remain, even if the general method 
be preserved. 

My first efforts, however, to solve the fundamental approximative 
equation were along the same lines which Fabritius has followed :— 
to set r,=r, and 7,=7, in equation second of (2) of Fabritius, which 
will give p, and r,, then to get 7, from the first of (3) of Fabritius, 
and then r, either from equation second of (3) or from some other 


*[In which the preceding memoir, and also that of Beebe and Phillips referred to 
below, are translated. | 

+[See Fabritius, W. ‘Ueber eine leichte Methode der Bahnbestimmung mit 
Zugrundelegung des Princips von Gibbs,” also ‘‘ Weitere Anwendungen des Gibbs’ chen 
Princips,” Astronomische Nachrichten, No. 3061, 3065 (1891)]. 

t[‘*The Orbit of Swift’s comet, 1880 V, Determined by Gibbs’s Vector Method,” 
W. Beebe and A. W. Phillips. Gould’s Astronomical Journal, vol. ix, Dec. 1889. ] 
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which would serve the purpose, and then to find better values of 
py , by setting in equation second of (2) 


lis T. 2? 3 > 2? 


the expressions in brackets denoting numbers derived from the 
approximate values already found. This is similar to or identical with 
the method of Fabritius, except that he combines with it the principle 
of interpolation (for the first value in the third “hypothesis”). As 
I found the approximation by this method sometimes slow or failing, 
notably in the case of Swift’s comet, 1880 V, I tried the method 
published in my paper. Indeed, it may be said that the method of 
my paper was constructed to meet the exigencies of the case of the 
comet, 1880 V. 

In ordinary cases I think that the method of Fabritius may very 
likely be better than that which I published. The equations are very 
simply and perspicuously represented in vector notations. I shall use 
the notations of my paper, writing £, F, etc., for German letters.* To 
eliminate p, and p, from equation (7) in my paper, multiply durectly 
by &, x8; This gives 


4,(1+7))\(C,8,8)- (1-7 )C3,8) + B,80)] 


+4,(1 +7 )(C8,8,)=0. (a) 


To eliminate p, and 7,, multiply by €,x®, which gives 
B 
A,(1 v ma) E333) + 018; G83) 


-(1-2:)(C,6,8,)+GE5)]=0. ©) 


When we have found p,, 7, p,, 7, it is not necessary to eliminate 
any of them, and to save labor in forming the equation for ps, 75, I 
should be inclined to take the components in (7) in the direction of 
one of the coordinate axes, choosing that one which is most nearly 
directed towards the third observed position. However, I will write 


A+ 7 NCB) + a8 D]- (1-22) B+ 0B] 
+A,(1+23)(6,)+ p4(B,-B)]=0, ©) 


where $ may represent an axis of coordinates, or (€,x#,) which 
would give Fabritius’ equation. It might be directed towards the pole 


* [In the remainder of the letter as here printed German capitals have been substituted 


for the E, yi > I > ete. of the original, thus making the notation uniform with that of 
the paper referred to. | 


~ 
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of the ecliptic, which would make (&,.$), (G,.%), (€,.$) vanish, 
except for exceedingly minute quantities depending on the latitude of 
the sun and the geocentric coordinates of the observatories, if these 
are included in &,, &,, G;. 

The equations (a), (6), (c), which are together equivalent to (7), I 
would solve as follows, almost in the same way as Fabritius, but 
relying a little more on interpolation, and less on the convergence of 
which he speaks, which in special cases may more or less fail. 

Setting r,=r, and r,=r, in (a), which thus modified I shall call (a’), 
and solving this (a’) by “trial and error,” using p, as the independent 
variable, as soon as I have a value of p, which I think will give a 


residual of (a’) of the same order of smallness as the effect of changing 
1 


e and = into ae I determine from this value by (6) and (c), 7, and 
rz, and then find the residual of (a), using the values of 7,, 75, 7 
derived all from the same assumed p,. Now using the last value of 
A (residual) . 
Ap» 
only roughly to the (a), I would get a value p, which I would use for 
the second “hypothesis” in (a). This will give a second residual in 
(a), which will enable me to make a more satisfactory interpolation. 
As many more interpolations may be made as shall be found necessary. 

Some such method, which should perhaps be called the method of 
Fabritius, would, I think, in most cases probably be the best for 
solution of equation (7). 

Of course I am quite aware that the merit of my paper, if any, lies 
principally in the fundamental approximation (1). I will add a few 
words on this subject. 

The equation may be written more symmetrically 


T, Te +7," — ae = ( Te +72 — ae 
a een ae: as 


+73 (1-7 +73" ee =a I 


in my previous calculations on (a’) which indeed applies 


It might be made entirely symmetrical by writing —7, for T,. 

If an expression ending with ¢? had been used, we could still have 
satisfied two of the conditions relating to acceleration, and should 
have obtained 


T 9, —(1+ Ty arte zy +18 &r, 7 7 let, =0, Ila 
2 
T. TZ : Ts ty - ay, TIb 
or 0 + Ce a, aes Re = st -—- 6r,° at, ? 
Ti(1 4TH — 7p, Ts T2\op 4 Tage, =O. I 
ad (1 + 6r, My, (1 a 6r,° dors 3 tie f 
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Using an expansion ending with ¢? we can only satisfy one con- 
dition relating to acceleration, say the second. This will give 


Tig od ue ee cy Til 
Ts Mt, ( sa + T. a 
(Gauss uses virtually 
T. R, Tey 
Ro run te ee 
a 2r,8 
2 


which is a little more convenient, but not, I think, generally quite 
so accurate. ) 

Writing an equation analogous to ITI for the earth and subtracting 
from (7), Mem. Nat. Acad., we have 

4 1 
8 rh (a a Bot) eaB. +a p33= es ~ 75) 
which gives, on multiplication by 3x3, and §,x©,, theorems of 
Olbers and Lambert. 

It is evident that im general the error in I is of the fifth order, in 
IIa, IIb, IIc of the fourth, and in III of the third. But for equal 
intervals, the error in I is of the sixth order, and in III of the fourth. 
And when 7,?+7,’—37,2=0, Ila becomes identical with I, and its 
error is of the fifth order. 

The same is true of IIc in the corresponding case. It follows that 
when the intervals are nearly as 5:8 we should use Ia or Ib instead 
of I. This will evidently abbreviate the solution given above as 
only one of the quantities 7,, r, is to be used. 

The formule Ila, IIb, IIc may also be obtained by the following 
method, which will show their relative accuracy. 

The interpolation formula 


has an error evidently of the second order. If we multiply by 

2 297 2 
ee od subtract from I, we get Ia. So if we multiply by 

Te +7 — 37,2 ea tT +7,?— 37,7 
24 24 

we get IIb or IIc. The errors due to using one of these equations 
instead of I are therefore proportional to these multipliers and very 
unequal, 
Again, in case of equal intervals, IIa and IIc become identical with 
III, There is, therefore, no reason for using Ila or Ie when the 


intervals are nearly equal. IIb is in this case much less accurate 
than III. 
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It will be observed that all the formule I, Ia, IIb, Ie, IIT, may be 

expressed in the general form 

(+ B)a.-(48) e204 ma0 

except that the letters B,, B,, B, have different values in the different 
cases, some vanishing in the more simple formulx. Moreover, if the 
_ values of B,, B,, B; have been calculated for I, the values for IIa, IIb, 
or Ile are found simply by subtraction of one of the numbers from the 
three. It is evident that IIb will hardly be useful except in special 
cases, as in the determination of a parabolic orbit in the failing case of 
Olbers’ method, and then it would be a question whether it would not 
be better to determine the orbit from p, and p;, or p, and p,, using Ia 
or Ile. 

Equations Ila and IIc are very appropriate for the determination 
of an elliptic orbit when the observed motion is nearly in the ecliptic, 
by means of four observations with intervals nearly in the ratio 
5:8:5. 

It is evident that the solution of (7) given above may be varied, in 
ways too numerous to mention, by the use of the simpler forms IIa, 
IIe, or III for I in the earlier stages of the work. This only involves 
changing the values of B,, B,, B,, in (a), (b) and (c). 

It is not correct to say that in my expressions for the ratios of the 
triangles the error is of the fifth order in general, or for equal intervals, 
of the sixth. If we write p,, p2, p;, for the coefficients of R,, K,, Ry, 
in I, and & for the error of the equation, we have exactly 


PK, —p.R,+ pK; = 2%, 
which gives PR XR; — poh. xR, = Tx, 


RXR; _ 7, XR; 
RXR; Po Poly xRy 


Now /1 is my expression for the ratio of the triangles, and aioe 

2 Paty hy 

is its ae This is of the fourth order in general (since the denomin- 
ator is of the first), and for equal intervals, of the fifth. The same is 


true of the two other ratios. Thus we have 
Ri xR, _ ps — Hi x® 
RXR, Do pw, po, XRs 
Adding these equations and subtracting 1 [from both sides] we have 
RxR,+ xR, —HxRs_ PitPs aay (Hs — ¥)xt 
Fy xR; P2 + Ph xR, 


Here the last term, which represents the error, is of the fifth 
order in general, or for equal intervals, of the sixth. But the 
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quantity sought is of the second order, and the relative error is of 
the third order in the general case, or the fourth for equal intervals. 
It is precisely this error which is most important in the case of 
elliptic orbits. 

It will be observed that the accuracy of the expressions for the 
ratios [1,72]: [79"'3]:[7173] affords no measure of the accuracy of the 
formula for the determination of elliptic orbits. 

I think that this hasty sketch will illustrate the convenience and 
perspicuity of vector notations in this subject, quite independently of 
any particular method which is chosen for the determination of the 
orbit. What is the best method? is hardly, I think, a question which 
admits of a definite reply. It certainly depends upon the ratio of the 
time intervals, their absolute value, and many other things. 


Yours very truly, 
J. WILLARD GIBBS. 


P.S.—If we wish to use the cwrtate distances, with reference to the 
ecliptic or the equator, let p, be defined as the distance multiplied by 
cosine (lat. or dec.), and 3, as a vector of length secant (lat. or dec.). 
For the most part the formule will require no change, but the square 
of 3, will be sec*(lat. or dec.) instead of unity, so that the last terms 
of (8) will have this factor. (%, 3,3) will then be Gauss’ (0.1.2.), 
whereas in my paper (%, 8203) 1s Lagrange’s (C’C” 0”). 

J. Wee 


VIL. 


ON THE ROLE OF QUATERNIONS IN THE ALGEBRA 
OF VECTORS. 


[Nature, vol. xLu1. pp. 511-513, April 2, 1891.] 


THE following passage, which has recently come to my notice, in 
the preface to the third edition of Prof. Tait’s Quaternions seems 
to call for some reply : 

“Even Prof. Willard Gibbs must be ranked as one of the retarders 
of quaternion progress, in virtue of his pamphlet on Vector Analysis, 
a sort of hermaphrodite monster, compounded of the notations of 
Hamilton and of Grassmann.” 

The merits or demerits of a pamphlet printed for private distri- 
bution a good many years ago do not constitute a subject of any great 
importance, but the assumptions implied in the sentence quoted are 
suggestive of certain reflections and inquiries which are of broader 
interest, and seem not untimely at a period when the methods and 
results of the various forms of multiple algebra are attracting so much 
attention. It seems to be assumed that a departure from quaternionic 
usage in the treatment of vectors is an enormity. If this assumption 
is true, it is an important truth; if not, it would be unfortunate if it 
should remain unchallenged, especially when supported by so high an 
authority. The criticism relates particularly to notations, but I 
believe that there is a deeper question of notions underlying that of 
notations. Indeed, if my offence had been solely in the matter of 
notation, it would have been less accurate to describe my production 
as a monstrosity, than to characterize its dress as uncouth. 

Now what are the fundamental notions which are germane to a 
vector analysis? (A vector analysis is of course an algebra for 
vectors, or something which shall be to vectors what ordinary algebra 
is to ordinary quantities.) If we pass over those notions which are so 
simple that they go without saying, geometrical addition (denoted 
by +) is, perhaps, first to be mentioned. Then comes the product of 
the lengths of two vectors and the cosine of the angle which they 
include. This, taken negatively, is denoted in quaternions by Saf, 
where a and # are the vectors. Equally important is a vector 
at right angles to a and 6 (on a specified side of their plane), and 
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representing in length the product of their lengths and the sine of 
the angle which they include. This is denoted by Va in quaternions. 
How these notions are represented in my pamphlet is a question of 
very subordinate consequence, which need not be considered at 
present. The importance of these notions, and the importance of a 
suitable notation for them, is not, I suppose, a matter on which there 
is any difference of opinion. Another function of a and 8, called 
their product and written af, is used in quaternions. In the general 
case, this is neither a vector, like Vaf, nor a scalar (or ordinary 
algebraic quantity), like Sa@, but a quaternion—that is, it is part 
vector and part scalar. It may be defined by the equation— 


aB=VaB+Sa. 


‘The question arises, whether the quaternionic product can claim a 
prominent and fundamental place in a system of vector analysis. It 
certainly does not hold any such place among the fundamental 
geometrical conceptions as the geometrical sum, the scalar product, 
or the vector product. The geometrical sum a+ represents the 
third side of a triangle as determined by the sides a and 8. Va 
represents in magnitude the area of the parallelogram determined by 
the sides a and 8, and in direction the normal to the plane of the 
parallelogram. SyVaG represents the volume of the parallelopiped 
determined by the edges a, 8, and y. These conceptions are the very 
foundations of geometry. 

We may arrive at the same conclusion from a somewhat narrower 
but very practical point of view. It will hardly be denied that 
sines and cosines play the leading parts in trigonometry. Now the 
notations Va8 and Sa@ represent the sine and the cosine of the angle 
included between a and B, combined in each case with certain other 
simple notions. But the sine and cosine combined with these 
auxiliary notions are incomparably more amenable to analytical 
transformation than the simple sine and cosine of trigonometry, 
exactly as numerical quantities combined (as in algebra) with the 
notion of positive or negative quality are incomparably more amenable 
to analytical transformation than the simple numerical quantities of 
arithmetic. 

I do not know of anything which can be urged in favor of the 
quaternionic product of two vectors as a fundamental notion in 
vector analysis, which does not appear trivial or artificial in com- 
parison with the above considerations. The same is true of the 
quaternionic quotient, and of the quaternion in general. 

How much more deeply rooted in the nature of things are the 
functions Saf and Va@ than any which depend on the definition 
of a quaternion, will appear in a strong light if we try to extend 


ee 
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our formulee to space of four or more dimensions. It will not be 
claimed that the notions of quaternions will apply to such a space, 
except indeed in such a limited and artificial manner as to rob them 
of their value as a system of geometrical algebra. But vectors exist 
in such a space, and there must be a vector analysis for such a space, 
The notions of geometrical addition and the scalar product are 
evidently applicable to such a space. As we cannot define the 
direction of a vector in space of four or more dimensions by the con- 
dition of perpendicularity to two given vectors, the definition of VaQ, 
as given above, will not apply totidem verbis to space of four or more 
dimensions. But a little change in the definition, which would make 
no essential difference in three dimensions, would enable us to apply 
the idea at once to space of any number of dimensions. 

These considerations are of a somewhat a priori nature. It may be 
more convincing to consider the use actually made of the quaternion 
as an instrument for the expression of spatial relations. The principal 
use seems to be the derivation of the functions expressed by Sa and 
Va8. Each of these expressions is regarded by quaternionic writers 
as representing two distinct operations; first, the formation of the 
product a8, which is the quaternion, and then the taking out of this 
quaternion the scalar or the vector part, as the case may be, this 
second process being represented by the selective symbol, S or V. 
This is, I suppose, the natural development of the subject in a treatise 
on quaternions, where the chosen subject seems to require that we 
should commence with the idea of a quaternion, or get there as soon 
as possible, and then develop everything from that particular point of 
view. In a system of vector analysis, in which the principle of de- 
velopment is not thus predetermined, it seems to me contrary to good 
method that the more simple and elementary notions should be defined 
by means of those which are less so. 

The quaternion affords a convenient notation for rotations. The 
notation g( )q™1, where q is a quaternion and the operand is to be 
written in the parenthesis, produces on all possible vectors just such 
changes as a (finite) rotation of a solid body. Rotations may also be 
represented, in a manner which seems to leave nothing to be desired, 
by linear vector functions. Doubtless each method has advantages in 
certain cases, or for certain purposes. But since nothing is more 
simple than the definition of a linear vector function, while the 
definition of a quaternion is far from simple, and since in any case 
linear vector functions must be treated in a system of vector analysis, 
capacity for representing rotations does not seem to me sufficient to 
entitle the quaternion to a place among the fundamental and necessary 
notions of a vector analysis. 

Another use of the quaternionic idea is associated with the symbol V. 


158 QUATERNIONS IN THE ALGEBRA OF VECTORS. 


The quantities written SVw and VVo, where denotes a vector having 
values which vary in space, are of fundamental importance in physics. 
In quaternions these are derived from the quaternion Vw by selecting 
respectively the scalar or the vector part. But the most simple and 
elementary definitions of SVw and VVw arc quite independent of the 
conception of a quaternion, and the quaternion Vw is scarcely used 
except in combination with the symbols S and V, expressed or 
implied. There are a few formule in which there is a trifling gain in 
compactness in the use of the quaternion, but the gain is very trifling 
so far as I have observed, and generally, it seems to me, at the expense 
of perspicuity. ° 

These considerations are sufficient, I think, to show that the position 
of the quaternionist is not the only one from which the subject of 
vector analysis may be viewed, and that a method which would be 
monstrous from one point of view, may be normal and inevitable from 
another. 

Let us now pass to the subject of notations. I do not know 
wherein the notations of my pamphlet have any special resemblance 
to Grassmann’s, although the point of view from which the pamphlet 
was written is certainly much nearer to his than to Hamilton’s. But 
this a matter of minor consequence. It is more important to ask, 
What are the requisites of a good notation for the purposes of vector 
analysis? There is no difference of opinion about the representation 
of geometrical addition. When we come to functions having an 
analogy to multiplication, the products of the lengths of two vectors 
and the cosine of the angle which they include, from any point of 
view except that of the quaternionist, seems more simple than the 
same quantity taken negatively. Therefore we want a notation for 
what is expressed by —Sa, rather than Sa8, in quaternions. Shall 
the symbol denoting this function be a letter or some other sign ? 
and shall it precede the vectors or be placed between them? A little 
reflection will show, I think, that while we must often have recourse 
to letters to supplement the number of signs available for the ex- 
pression of all kinds of operations, it is better that the symbols 
expressing the most fundamental and frequently recurring operations 
should not be letters, and that a sign between the vectors, and, as it 
were, uniting them, is better than a sign before them in a case having 
a formal analogy with multiplication. The case may be compared 
with that of addition, for which a+ is evidently more convenient 
than (a, 8) or Za would be. Similar considerations will apply to 
the function written in quaternions Va@. It would seem that we 
obtain the ne plus ultra of simplicity and convenience, if we express 
the two functions by uniting the vectors in each case with a sign 
suggestive of multiplication. The particular forms of the signs which 
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we adopt is a matter of minor consequence. In order to keep within 
the resources of an ordinary printing office, I have used a dot and a 
cross, which are already associated with multiplication, but are not 
needed for ordinary multiplication, which is best denoted by the 
simple juxtaposition of the factors. I have no especial predilection 
for these particular signs. The use of the dot is indeed liable to the 
objection that it interferes with its use as a separatrix, or instead of a 
parenthesis. 

If, then, I have written a.8 and ax for what is expressed in 
quaternions by —Sa@ and Va@, and in like manner V.w and Vxw 
for —SVw and VVw in quaternions, it is because the natural develop- 
ment of a vector analysis seemed to lead logically to some such 
notations. But I think that I can show that these notations have 
some substantial advantages over the quaternionic in point of con- 
venience. 

Any linear vector function of a variable vector p may be expressed 
in the form— 


ar.p+Bu.p+y.p=(ad+Butyv).p=®.p, 
where b=art\+ Buty; 
or in quaternions 
—aSAp— BSup— ySvp= —(aSr + BSu+ ySv) p= — gp, 
where p=aSrA+ 6Sut+ySv. 


If we take the scalar product of the vector ®.p, and another vector 
o, we obtain the scalar quantity 


o.P.p=c.(at\t+hut+yy)-p, 
or in quaternions Scg¢p=Sa(aSrA+ BSut+ ySv) p. 


This is a function of o and of p, and it is exactly the same kind of 
function of o that it is of p,a symmetry which is not so clearly 
exhibited in the quaternionic notation as in the other. Moreover, we 
can write ¢.® for c.(aA+6ut+yv). This represents a vector which 
is a function of o, viz., the function conjugate to &.c; and «.&.p 
may be regarded as the product of this vector and p. This is not 
so clearly indicated in the quaternionic notation, where it would be 
straining things a little to call So@ a vector. 

The combinations aA, By, etc., used above, are distributive with 
regard to each of the two vectors, and may be regarded as a kind of 
product. If we wish to express everything in terms of 1, 9, and k, ® 
will appear as a sum of %, ¥, 2k, ji, Jj, jk, ki, ky, kk, each with a 
numerical coefficient. These nine coefficients may be arranged in a 
square, and constitute a matrix; and the study of the properties of 
expressions like @ is identical with the study of ternary matrices. 
This expression of the matrix as a sum of products (which may be 
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extended to matrices of any order) affords a point of departure from 
which the properties of matrices may be deduced with the utmost 
facility. The ordinary matricular product is expressed by a dot, as 
®.%. Other important kinds of multiplication may be defined by 
the equations— 
(ar)x(Bu)=(axB)(AXm),  (ad):(Bu)=(a-B)(A.«). 

With these definitions 46%: will be the determinant of ®, and 
®X® will be the conjugate of the reciprocal of multiplied by twice 
the determinant. If @ represents the manner in which vectors are 
affected by a strain, 1#2%@ will represent the manner in which 
surfaces are affected, and 1@y@:@ the manner in which volumes 
are affected. Considerations of this kind do not attach themselves so 
naturally to the notation ¢=aSA+6Su+ySy, nor does the subject 
admit so free a development with this notation, principally because 
the symbol S refers to a special use of the matrix, and is very much 
in the way when we want to apply the matrix to other uses, or to 
subject it to various operations. 


= ee 


VIII. 


QUATERNIONS AND THE AUSDEHNUNGSLEHRE. 
[Natwre, vol. XLIv. pp. 79-82, May 28, 1891.] 


THE year 1844 is memorable in the annals of mathematics on account 
of the first appearance on the printed page of Hamilton’s Quaternions 
and Grassmann’s Ausdehnungslehre. The former appeared in the 
July, October, and supplementary numbers of the Philosophical Maga- 
zine, after a previous communication to the Royal: Irish Academy, 
November 13, 1843. This communication was indeed announced to 
the Council of the Academy four weeks earlier, on the very day of 
Hamilton’s discovery of quaternions, as we learn from one of his 
letters. The author of the Ausdehnungslehre, although not uncon- 
scious of the value of his ideas, seems to have been in no haste to 
place himself on record, and published nothing until he was able 
to give the world the most characteristic and fundamental part of 
his system with considerable development in a treatise of more than 
300 pages, which appeared in August 1844, 

The doctrine of quaternions has won a conspicuous place among the 
various branches of mathematics, but the nature and scope of the 
Ausdehnungslehre, and its relation to quaternions, seem to be still 
the subject of serious misapprehension in quarters where we naturally 
look for accurate information. Historical justice, and the interests of 
mathematical science, seem to require that the allusions to the Ausdehn- 
ungslehre in the article on “Quaternions” in the last edition of the 
Encyclopedia Britannica, and in the third edition of Prof. Tait’s 
Treatise on Quaternions, should not be allowed to pass without protest. 

It is principally as systems of geometrical algebra that quaternions 
and the Ausdehnwngslehre come into comparison. To appreciate the 
relations of the two systems, I do not see how we can proceed better 
than if we ask first what they have in common, then what either 
system possesses which is peculiar to itself. The relative extent and 
importance of the three fields, that which is common to the two 
systems, and those which are peculiar to each, will determine the 
relative rank of the geometrical algebras. Questions of priority can 
only relate to the field common to both, and will be much simplified 
by having the limits of that field clearly drawn. 

G. I. L 
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Geometrical addition in three dimensions is common to the two 
systems, and seems to have been discovered independently both by 
Hamilton and Grassmann, as well as by several other persons about 
the same time. It is not probable that any especial claim for priority 
with respect to this principle will be urged for either of the two with 
which we are now concerned. 

The functions of two vectors which are represented in quaternions 
by Sa8 and Va are common to both systems as published in 1844, 
but the quaternion is peculiar to Hamilton’s. The linear vector 
function is common to both systems as ultimately developed, although 
mentioned only by Grassmann as early as 1844. 

To those already acquainted with quaternions, the first question will 
naturally be: To what extent are the geometrical methods which are 
usually called quaternionic peculiar to Hamilton, and to what extent 
are they common to Grassmann? This is a question which anyone 
can easily decide for himself. It is only necessary to run one’s eye 
over the equations used by quaternionic writers in the discussion of 
geometrical or physical subjects, and see how far they necessarily 
involve the idea of the quaternion, and how far they would be in- 
telligible to one understanding the functions Sa@ and Va, but having 
no conception of the quaternion af, or at least could be made so by 
trifling changes of notation, as by writing S or V in places where 
they would not affect the value of the expressions. For such a test 
the examples and illustrations in treatises on quaternions would be 
manifestly inappropriate, so far as they are chosen to illustrate 
quaternionic principles, since the object may influence the form of 
presentation. But we may use any discussion of geometrical or 
physical subjects, where the writer is free to choose the form most 
suitable to the subject. I myself have used the chapters and sections 
in Prof. Tait’s Quaternions on the following subjects: Geometry of 
the straight line and plane, the sphere and cyclic cone, surfaces of 
the second degree, geometry of curves and surfaces, kinematics, statics 
and kinetics of a rigid system, special kinetic problems, geometrical 
and physical optics, electrodynamics, general expressions for the action 
between linear elements, application of V to certain physical analogies, 
pp. 160-371, except the examples (not worked out) at the close of the 
chapters. 

Such an examination will show that for the most part the methods 
of representing spatial relations used by quaternionic writers are 
common to the systems of Hamilton and Grassmann. To an extent 
comparatively limited, cases will be found in which the quaternionic 
idea forms an essential element in the signification of the equations. 
The question will then arise with respect to the comparatively 
limited field which is the peculiar property of Hamilton, How im- 


QUATERNIONS AND THE AUSDEHNUNGSLEHRE. 163 


portant are the advantages to be gained by the use of the quaternion ? 
This question, unlike the preceding, is one into which a personal 
equation will necessarily enter. Everyone will naturally prefer the 
methods with which he is most familiar; but I think that it may be 
safely affirmed that in the majority of cases in this field the advantage 
derived from the use of the quaternion is either doubtful or very 
trifling. There remains a residuum of cases in which a substantial 
advantage is gained by the use of the quaternionic method. Such 
cases, however, so far as my own observation and experience extend, 
are very exceptional. If a more extended and careful inquiry should 
show that they are ten times as numerous as I have found them, they 
would still be exceptional. 

We have now to inquire what we find in the Ausdehnwngslehre in 
the way of a geometrical algebra, that is wanting in quaternions. In 
addition to an algebra of vectors, the Ausdehnwngslehre affords a 
system of geometrical algebra in which the point is the fundamental 
element, and which for convenience I shall call Grassmann’s algebra 
of points. In this algebra we have first the addition of points, or 
quantities located at points, which may be explained as follows. The 


equation aA+bB+cC+ ete.=cE+fF + ete, 


in which the capitals denote points, and the small letters scalars (or 
ordinary algebraic quantities), signifies that 
a+b+c+ ete.=e+f-+ ete, 

and also that the centre of gravity of the weights a, 8, ¢, etc., at the 
points A, B, OC, etc. is the same as that of the weights e, f, ete., at 
the points E, F, etc. (It will be understood that negative weights are 
allowed as well as positive.) The equation is thus equivalent to four 
equations of ordinary algebra. In this Grassmann was anticipated by 
Mobius (Barycentrischer Calcul, 1827). 

We have next the addition of finite straight lines, or quantities 
located in straight lines (Liniengréssen). The meaning of the 


equation AB+CD+ ete. =EF+GH+ ete. 


will perhaps be understood most readily, if we suppose that each 
member represents a system of forces acting on a rigid body. The 
equation then signifies that the two systems are equivalent. An 
equation of this form is therefore equivalent to six ordinary equations. 
It will be observed that the Liniengréssen AB and CD are not simply 
vectors; they have not merely length and direction, but they are also 
located each in a given line, although their position within those 
lines is immaterial. In Clifford’s terminology, AB is a rotor, AB-++-CD 
a motor. In the language of Prof. Ball’s Theory of Screws, AB+CD 
represents either a twist or a wrench. 
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We have next the addition of plane surfaces (Plangréssen). The 
cheats ABC+DEF+GHI=JKL 


signifies that the plane JKL passes through the point common to the 
planes ABC, DEF, and GHI, and that the projection by parallel lines 
of the triangle JKL on any plane is equal to the sum of the pro- 
jections of ABC, DEF, and GHI on the same plane, the areas being 
taken positively or negatively according to the cyclic order of the 
projected points. This makes the equation equivalent to four ordinary 
equations. 
Finally, we have the addition of volumes, as in the equation 
ABCD+EFGH =IJKL, 


where there is nothing peculiar, except that each term represents the 
six-fold volume of the tetrahedron, and is to be taken positively or 
negatively according to the relative position of the points. 

We have also multiplications as follows: The line (Liniengrésse) 
AB is regarded as the product of the points A and B, The Plangrésse 
ABC, which represents the double area of the triangle, is regarded as 
the product of the three points A, B, and C, or as the product of the 
line AB and the point C, or of BC and A, or indeed of BA and C. 
The volume ABCD, which represents six times the tetrahedron, is 
regarded as the product of the points A, B, C, and D, or as the 
product of the point A and the Plangrésse BCD, or as the product of 
the lines AB and BC, etce., ete. 

This does not exhaust the wealth of multiplicative relations which 
Grassmann has found in the very elements of geometry. The 
following products are called regressive, as distinguished from the 
progressive, which have been described. The product of the Plan- 
grossen ABC and DEF is a part of the line in which the planes ABC 
and DEF intersect, which is equal in numerical value to the product 
of the double areas of the triangles ABC and DEF multiplied by the 
sine of the angle made by the planes. The product of the Linien- 
grosse AB and the Plangrésse CDE is the point of intersection of the 
line and the plane with a numerical coefficient representing the 
product of the length of the line and the double area of the triangle 
multiplied by the sine of the angle made by the line and the plane. 
The product of three Plangréssen is consequently the point common 
to the three planes with a certain numerical coefficient. In plane 
geometry we have a regressive product of two Liniengréssen, which 
gives the point of intersection of the lines with a certain numerical 
coefficient. 

The fundamental operations relating to the point, line, and plane 
are thus translated into analysis by multiplications. The immense 
flexibility and power of such an analysis will be appreciated by 
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anyone who considers what generalized multiplication in connection 
with additive relations has done in other fields, as in quaternions, or 
in the theory of matrices, or in the algebra of logic. For a single 
example, if we multiply the equation 


AB+CD-+ete.=EF+GH-+ ete. 
by PQ (P and Q being any two points), we have 


ABPQ+CDPQ+ etc. = EFPQ+GHPQ + etc., 
which will be recognised as expressing an important theorem of 
statics. 

The field in which Grassmann’s algebra of points, as distinguished 
from his algebra of vectors, finds its especial application and utility 
is nearly coincident with that in which, when we use the methods 
of ordinary algebra, tetrahedral or anharmonic coordinates are more 
appropriate than rectilinear. In fact, Grassmann’s algebra of points 
may be regarded as the application of the methods of multiple algebra 
to the notions connected with tetrahedral coordinates, just as his or 
Hamilton’s algebra of vectors may be regarded as the application of 
the methods of multiple algebra to the notions connected with recti- 
linear coordinates. These methods, however, enrich the field to 
which they are applied with new notions. Thus the notion of the 
coordinates of a line in space, subsequently introduced by Pliicker, 
was first given in the Ausdehnungslehre of 1844. It should also be 
observed that the utility of a multiple algebra when it takes the place 
of an ordinary algebra of four coordinates, is very much greater 
than when it takes the place of three coordinates, for the same reason 
that a multiple algebra taking the place of three coordinates is very 
much more useful than one taking the place of two. Grassmann’s 
algebra of points will always command the admiration of geometers 
and analysts, and furnishes an instrument of marvellous power to the 
former, and in its general form, as applicable to space of any number 
of dimensions, to the latter. To the physicist an algebra of points is 
by no means so indispensable an instrument as an algebra of vectors. 

Grassmann’s algebra of vectors, which we have described as co- 
incident with a part of Hamilton’s system, is not really anything 
separate from his algebra of points, but constitutes a part of it, the 
vector arising when one point is subtracted from another. Yet it 
constitutes a whole, complete in itself, and we may separate it from 
the larger system to facilitate comparison with the methods of 
Hamilton. 

We have, then, as geometrical algebras published in 1844, an 
algebra of vectors common to Hamilton and Grassmann, augmented 
on Hawmilton’s side by the quaternion, and on Grassmann’s by 
his algebra of points. This statement should be made with the 
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reservation that the addition both of vectors and of points had 
been given by earlier writers. 

In both systems as finally developed we have the linear vector 
function, the theory of which is identical with that of strains and 
rotations, In Hamilton’s system we have also the linear quaternion 
function, and in Grassmann’s the linear function applied to the 
quantities of his algebra of points. This application gives those 
transformations in which projective properties are preserved, the 
doctrine of reciprocal figures or principle of duality, ete. (Grass- 
mann’s theory of the linear function is, indeed, broader than this, 
being coextensive with the theory of matrices; but we are here 
considering only the geometrical side of the theory.) 

In his earliest writings on quaternions, Hamilton does not discuss 
the linear function. In his Lectwres on Quaternions (1853), he treats 
of the inversion of the linear vector function, as also of the linear 
quaternion function, and shows how to find the latent roots of the 
vector function, with the corresponding axes for the case of real and 
unequal roots. He also gives a remarkable equation, the symbolic 
cubic, which the functional symbol must satisfy. This equation is 
a particular case of that which is given in Prof. Cayley’s classical 
Memoir on the Theory of Matrices (1858), and which is called by 
Prof. Sylvester the Hamilton-Cayley equation. In his Elements of 
Quaternions (1866), Hamilton extends the symbolic equation to the 
quaternion function. 

In Grassmann, although the linear function is mentioned in the 
first Ausdehnungslehre, we do not find so full a discussion of the 
subject until the second Awsdehnungslehre (1862), where he discusses 
the latent roots and axes, or what corresponds to axes in the general 
theory, the whole discussion relating to matrices of any order. The 
more difficult cases are included, as that of a strain in which all 
the roots are real, but there is only one axis or unchanged direction. 
On the formal side he shows how a linear function may be repre- 
sented by a quotient or sum of quotients, and by a sum of products, 
Liickenausdruck. 

More important, perhaps, than the question when this or that 
theorem was first published is the question where we first find those 
notions and notations which give the key to the algebra of linear 
functions, or the algebra of matrices, as it is now generally called. 
In vol. xxxi, p. 35, of Natwre, Prof. Sylvester speaks of Cayley’s 
“ever-memorable” Memoir on Matrices as constituting “a second 
birth of Algebra, its avatar in a new and glorified form,” and refers 
to a passage in his Lectwres on Universal Algebra, from which, I 
think, we are justified in inferring that this characterization of the 
memoir is largely due to the fact that it is there shown how matrices 
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may be treated as extensive quantities, capable of addition as well as 
of multiplication. This idea, however, is older than the memoir of 
1858. The Liickenausdruck, by which the matrix is expressed as a 
sum of a kind of products (ltickenhaltig, or open), is described in a note 
at the end of the first Auwsdehnungslehre. There we have the matrix 
given not only as a sum, but as a sum of products, introducing a 
multiplicative relation entirely different from the ordinary multipli- 
cation of matrices, and hardly less fruitful, but not lying nearly so 
near the surface as the relations to which Prof. Sylvester refers. The 
key to the theory of matrices is certainly given in the first Ausdehn- 
ungslehre, and if we call the birth of matricular analysis the second 
birth of algebra, we can give no later date to this event than the 
memorable year of 1844. 

The immediate occasion of this communication is the following 
passage in the preface to the third edition of Prof. Tait’s Quaternions : 

“Hamilton not only published his theory complete, the year before 
the first (and extremely imperfect) sketch of the Ausdehnungslehre 
appeared ; but had given ten years before, in his protracted study of 
Sets, the very processes of external and internal multiplication (corre- 
sponding to the Vector and Scalar parts of a product of two vectors) 
which have been put forward as specially the property of Grassmann.” 

For additional information we are referred to art. “Quaternions,” 
Encyc. Brit., where we read respecting the first Ausdehnungslehre : 

“Tn particular two species of multiplication (‘inner’ and ‘ outer’) of 
directed lines in one plane were given. The results of these two 
kinds of multiplication correspond respectively to the numerical and 
the directed parts of Hamilton’s quaternion product. But Grassmann 
distinctly states in his preface that he had not had leisure to extend 
his method to angles in space.... But his claims, however great 
they may be, can in no way conflict with those of Hamilton, whose 
mode of multiplying cowples (in which the ‘inner’ and ‘outer’ multi- 
plication are essentially involved) was produced in 1833, and whose 
quaternion system was completed and published before Grassmann had 
elaborated for press even the rudimentary portions of his own system, 
in which the veritable difficulty of the whole subject, the application 
to angles in space, had not even been attacked.” 

I shall leave the reader to judge of the accuracy of the general 
terms used in these passages in comparing the first Ausdehnungslehre 
with Hamilton’s system as published in 1843 or 1844. The specific 
statements respecting Hamilton and Grassmann require an answer. 

It must be Hamilton’s Theory of Conjugate Functions or Algebrarc 
Couples (read to the Royal Irish Academy, 1833 and 1835, and 
published in vol. xvii of the Transactions) to which reference is made 
in the statements concerning his “protracted study of Sets” and 
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“mode of multiplying couples.” But I cannot find anything like 
Grassmann’s external or internal multiplication in this memoir, which 
is concerned, as the title pretty clearly indicates, with the theory of 
the complex quantities of ordinary algebra. 

It is difficult to understand the statements respecting the Ausdehn- 
ungslehre, which seem to imply that Grassmann’s two kinds of 
multiplication were subject to some kind of limitation to a plane. 
The external product is not limited in the first Ausdehnungslehre even 
to three dimensions. The internal, which is a comparatively simple 
matter, is mentioned in the first Ausdehnungslehre only in the preface, 
where it is defined, and placed beside the external product as relating 
to directed lines. There is not the least suggestion of any difference 
in the products in respect to the generality of their application to 
vectors. 

The misunderstanding seems to have arisen from the following 
sentence in Grassmann’s preface: “ And in general, in the consideration 
of angles in space, difficulties present themselves, for the complete 
(allseitig) solution of which I have not yet had sufficient leisure.” 
It is not surprising that Grassmann should have required more time 
for the development of some parts of his system, when we consider 
that Hamilton, on his discovery of quaternions, estimated the time 
which he should wish to devote to them at ten or fifteen years (see 
his letter to Prof. Tait in the North British Review for September 
1866), and actually took several years to prepare for the press as 
many pages as Grassmann had printed in 1844, But any speculation 
as to the questions which Grassmann may have had principally in 
mind in the sentence quoted, and the particular nature of the diffi- 
culties which he found in them, however interesting from other points 
of view, seems a very precarious foundation for a comparison of the 
systems of Hamilton and Grassmann as published in the years 
1843-44. Such a comparison should be based on the positive evidence 
of doctrines and methods actually published. 

Such a comparison I have endeavoured to make, or rather to 
indicate the basis on which it may be made, so far as systems of 
geometrical algebra are concerned. As a contribution to analysis in 
general, I suppose that there is no question that Grassmann’s system 
is of indefinitely greater extension, having no limitation to any 
particular number of dimensions. 


IX. 


QUATERNIONS AND THE ALGEBRA OF VECTORS. 
[Wature, vol. XLvi. pp. 463, 464, Mar. 16, 1893.] 


IN a recent number of Nature [vol. xlvii, p. 151], Mr. McAulay 
puts certain questions to Mr. Heaviside and to me, relating to a 
subject of such importance as to justify an answer somewhat at 
length. I cannot of course speak for Mr. Heaviside, although I 
suppose that his views are not very different from mine on the most 
essential points, but even if he shall have already replied before this 
letter can appear, I shall be glad to add whatever of force may belong 
to independent testimony. 

Mr. McAulay asks: “ What is the first duty of the physical vector 
analyst gud physical vector analyst?” The answer is not doubtful. 
It is to present the subject in such a form as to be most easily 
acquired, and most useful when acquired. 

In regard to the slow progress of such methods towards recognition 
and use by physicists and others, which Mr. McAulay deplores, it does 
not seem possible to impute it to any want of uniformity of notation. 
I doubt whether there is any modern branch of mathematics which 
has been presented for so long a time with a greater uniformity of 
notation than quaternions. 

What, then, is the cause of the fact which Mr. McAulay and all 
of us deplore? It is not far to seek. We need only a glance at the 
volumes in which Hamilton set forth his method. No wonder that 
physicists and others failed to perceive the possibilities of simplicity, 
perspicuity, and brevity which were contained in a system presented 
to them in ponderous volumes of 800 pages. Perhaps Hamilton may 
have intended these volumes as a sort of thesaurus, and we should 
look to his shorter papers for a compact account of his method. But 
if we turn to his earlier papers on Quaternions in the Philosophical 
Magazine, in which principally he introduced the subject to the 
notice of his contemporaries, we find them entitled “On Quaternions; 
or on a New System of Imaginaries in Algebra,” and in them we 
find a great deal about imaginaries, and very little of a vector 
analysis. To show how slowly the system of vector analysis 


- developed itself in the quaternionic nidus, we need only say that 


the symbols S, V, and V do not appear until two or three years 
after the discovery of quaternions. In short, it seems to have been 
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only a secondary object with Hamilton to express the geometrical 
relations of vectors,—secondary in time, and also secondary in this, 
that it was never allowed to give shape to his work. 

But this relates to the past. In regard to the present status, I beg 
leave to quote what Mr. McAulay has said on another occasion (see 
Phil. Mag., June 1892) :—* Quaternions differ in an important respect 
from other branches of mathematics that are studied by mathe- 
maticians after they have in the course of years of hard labour laid 
the foundation of all their future work, In nearly all cases these 
branches are very properly so called. They each grow out of a 
definite spot of the main tree of mathematics, and derive their 
sustenance from the sap of the trunk as a whole. But not so with 
quaternions. To let these grow in the brain of a mathematician, 
he must start from the seed as with the rest of his mathematics 
regarded as a whole. He cannot graft them on his already flourishing 
tree, for they will die there. They are independent plants that 
require separate sowing and the consequent careful tending.” 

Can we wonder that mathematicians, physicists, astronomers, and 
geometers feel some doubt as to the value or necessity of something 
so separate from all other branches of learning? Can that be a 
natural treatment of the subject which has no relations to any other 
method, and, as one might suppose from reading some treatises, has 
only occurred to a single man? Or, at best, is it not discouraging 
to be told that in order to use the quaternionic method, one must 
give up the progress which he has already made in the pursuit of 
his favourite science, and go back to the beginning and start anew 
on a parallel course ? 

I believe, however, that if what I have quoted is true of vector 
methods, it is because there is something fundamentally wrong in 
the presentation of the subject. Of course, in some sense and to some 
extent it is and must be true. Whatever is special, accidental, and 
individual, will die, as it should; but that which is universal and 
essential should remain as an organic part of the whole intellectual 
acquisition. If that which is essential dies with the accidental, it 
must be because the accidental has been given the prominence which 
belongs to the essential. For myself, I should preach no such doctrine 
to those whom I wish to convert to the true faith. 

In Italy, they say, all roads lead to Rome. In mechanics, kine- 
matics, astronomy, physics, all study leads to the consideration of 
certain relations and operations. These are the capital notions; these 
should have the leading parts in any analysis suited to the subject. 

If I wished to attract the student of any of these sciences to an 
algebra for vectors, I should tell him that the fundamental notions 
of this algebra were exactly those with which he was daily con- 
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versant. I should tell him that a vector algebra is so far from being 
any one man’s production that half a century ago several were already 
working toward an algebra which should be primarily geometrical and 
not arithmetical, and that there is a remarkable similarity in the results 
to which these efforts led (see Proc. A.A.A.S. for 1886, pp. 37, ff.) [this 
vol. p. 91, ff]. I should call his attention to the fact that Lagrange 
and Gauss used the notation (ay) to denote precisely the same as 
Hamilton by his S(a@y), except that Lagrange limited the expression 
to unit vectors, and Gauss to vectors of which the length is the secant 
of the latitude, and I should show him that we have only to give up 
these limitations, and the expression (in connection with the notion 
of geometrical addition) is endowed with an immense wealth of 
transformations. I should call his attention to the fact that the 
notation [7,r,], universal in the theory of orbits, is identical with 
Hamilton’s V(p,p,), except that Hamilton takes the area as a vector, 
i.e., includes the notion of the direction of the normal to the plane of 
the triangle, and that with this simple modification (and with the 
notion of geometrical addition of surfaces as well as of lines) this 
expression becomes closely connected with the first-mentioned, and is 
not only endowed with a similar capability for transformation, but 
enriches the first with new capabilities. In fact, I should tell him 
that the notions which we use in vector analysis are those which he 
who reads between the lines will meet on every page of the great 
masters of analysis, or of those who have probed deepest the secrets of 
nature, the only difference being that the vector analyst, having regard 
to the weakness of the human intellect, does as the early painters 
who wrote beneath their pictures “ This is a tree,” “ This is a horse.” 

I cannot attach quite so much importance as Mr. McAulay to 
uniformity of notation. That very uniformity, if it existed among 
those who use a vector analysis, would rather obscure than reveal 
their connection with the general course of modern thought in 
mathematics and physics. There are two ways in which we may 
measure the progress of any reform. The one consists in counting 
those who have adopted the shibboleth of the reformers; the other 
measure is the degree in which the community is imbued with the 
essential principles of the reform. I should apply the broader 
measure to the present case, and do not find it quite so bad as 
Mr. McAulay does. } “2nd 

Yet the question of notations, although not the vital question, is 
certainly important, and I assure Mr. McAulay that reluctance to 
make unnecessary innovations in notation has been a very powerful 
motive in restraining me from publication. Indeed my pamphlet on 
Vector Analysis, which has excited the animadversion of quater- 
nionists, was never formally published, although rather widely 
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distributed, so long as I had copies to distribute, among those who 
I thought might be interested in the subject. I may say, however, 
since I am called upon to defend my position, that I have found 
the notations of that pamphlet more flexible than those generally 
used. Mr. McAulay, at least, will understand what I mean by this, 
if I say that some of the relations which he has thought of sufficient 
importance to express by means of special devices (see Proc. RSE. 
for 1890-91), may be expressed at least as briefly in the notations 
which I have used, and without special devices. But I should not 
have been satisfied for the purposes of my pamphlet with any 
notation which should suggest even to the careless reader any con- 
nection with the notion of the quaternion. For I confess that one 
of my objects was to show that a system of vector analysis does not 
require any support from the notion of the quaternion, or, | may 
add, of the imaginary in algebra. 

I should hardly dare to express myself with so much freedom, if 
I could not shelter myself behind an authority which will not be 
questioned. 

I do not see that I have done anything very different from what 
the eminent mathematician upon whom Hamilton’s mantle has fallen 
has been doing, it would seem, unconsciously. Contrast the system of 
quaternions, which he has described in his sketch of Hamilton’s life 
and work in the North British Review for September, 1866, with the 
system which he urges upon the attention of physicists in the Philo- 
sophical Magazine in 1890. In 1866 we have a great deal about 
imaginaries, and nearly as much about the quaternion. In 1890 we 
have nothing about imaginaries, and little about the quaternion. 
Prof. Tait has spoken of the calculus of quaternions as throwing off 
in the course of years its early Cartesian trammels. I wonder that 
he does not see how well the progress in which he has led may be 
described as throwing off the yoke of the quaternion. A characteristic 
example is seen in the use of the symbol V. Hamilton applies this to 
a vector to form a quaternion, Tait to form a linear vector function. 
But while breathing a new life into the formule of quaternions, 
Prof. Tait stands stoutly by the letter. 

Now I appreciate and admire the generous loyalty toward one 
whom he regards as his master, which has always led Prof. Tait to 
minimise the originality of his own work in regard to quaternions, 
and write as if everything was contained in the ideas which flashed 
into the mind of Hamilton at.the classic Brougham Bridge. But not 
to speak of other claims of historical justice, we owe duties to our 
scholars as well as to our teachers, and the world is too large, and the 


current of modern thought is too broad, to be confined by the ypse 
dieit even of a Hamilton. 


~<a oe a 


ss Ys 


| 
‘ 
, 
} 
“e 
; 
b, 


x. 


QUATERNIONS AND VECTOR ANALYSIS. 
[Nature, vol. XLVII. pp. 364-367, Aug. 17, 1893.] 


IN a paper by Prof. C. G. Knott on “Recent Innovations in Vector 
Theory,” of which an abstract has been given in Nature (vol. xlvii, 
pp. 590-593; see also a minor abstract on p. 287), the doctrine that 
the quaternion affords the only sufficient and proper basis for vector 
analysis is maintained by arguments based so largely on the faults 
and deficiencies which the author has found in my pamphlet, Elements 
of Vector Analysis, as to give to such faults an importance which 
they would not otherwise possess, and to make some reply from me 
necessary, if I would not discredit the cause of non-quaternionic 
vector analysis. Especially is this true in view of the warm com- 
mendation and endorsement of the paper, by Prof. Tait, which 
appeared in Nature somewhat earlier (p. 225). 

The charge which most requires a reply is expressed most distinctly 
in the minor abstract, viz. “that in the development of his dyadic 
notation, Prof. Gibbs, being forced to bring the quaternion in, logically 
condemned his own position.” This was incomprehensible to me until 
I received the original paper, where I found the charge specified as 
follows: “Although Gibbs gets over a good deal of ground without 
the explicit recognition of the complete product, which is the difference 
of his ‘skew’ and ‘direct’ products, yet even he recognises in plain 
language the versorial character of a vector, brings in the quaternion 
whose vector is the difference of a linear vector function and its 
conjugate, and does not hesitate to use the accursed thing itself in 
certain line, surface, and volume integrals” (Proc. R.S.#., Session 
1892-3, p. 236). These three specifications I shall consider in their 
inverse order, premising, however, that the epitheta ornantia are 
entirely my critic’s. 

The last charge is due entirely to an inadvertence. The integrals 
referred to are those given at the close of the major abstract in 
Nature (p. 593). My critic, in his original paper, states quite correctly 
that, according to my definitions and notations, they should represent 
dyadics. He multiplies them into a vector, introducing the vector 
under the integral sign, as is perfectly proper, provided, of course, 
that the vector is constant. But failing to observe this restriction, 
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evidently through inadvertence, and finding that the resulting 
equations (thus interpreted) would not be true, he concludes that I 
must have meant something else by the original equations. Now, 
these equations will hold if interpreted in the quaternionic sense, as is, 
indeed, a necessary consequence of their holding in the dyadic sense, 
although the converse would not be true. My critic was thus led, 
in consequence of the inadvertence mentioned, to suppose that I had 
departed from my ordinary usage and my express definitions, and 
had intended the products in these integrals to be taken in the 
quaternionic sense. This is the sole ground for the last charge. 

The second charge evidently relates to the notations &s and , (see 
Nature, vol. xlvii, p. 592). It is perfectly true that I have used a 
scalar and a vector connected with the linear vector operator, which, 
if combined, would form a quaternion. I have not thus combined 
them. Perhaps Prof. Knott will say that since I use both of them it 
matters little whether I combine them or not. If so I heartily agree 
with him. 

The first charge is a little vague. I certainly admit that vectors 
may be used in connection with and to represent rotations I have no 
objection to calling them in such cases versorial. In that sense 
Lagrange and Poinsot, for example, used versorial vectors. But what 
has this to do with quaternions? Certainly Lagrange and Poinsot 
were not quaternionists. 

The passage in the major abstract in Natwre which most distinctly 
charges me with the use of the quaternion is that in which a certain 
expression which I use is said to represent the quaternion operator 
q(—_-)q-* (vol. xlvii, p. 592). It would be more accurate to say that 
my expression and the quaternionic expression represent the same 
operator. Does it follow that I have used a quaternion? Not at all. 
A quaternionic expression may represent a number. Does everyone 
who uses any expression for that number use quaternions? A 
quaternionic expression may represent a vector. Does everyone who 
uses any expression for that vector use quaternions? A quaternionic 
expression may represent a linear vector operator. If I use an 
expression for that linear vector operator do I therefore use quater- 
nions? My critic is so anxious to prove that I use quaternions that 
he uses arguments which would prove that quaternions were in 
common use before Hamilton was born. 

So much for the alleged use of the quaternion in my pamphlet. 
Let us now consider the faults and deficiencies which have been found 
therein and attributed to the want of the quaternion. The most 
serious criticism in this respect relates to certain integrating operators, 
which Prof. Tait unites with Prof. Knott in ridiculing. As definitions 
are wearisome, I will illustrate the use of the terms and notations 


QUATERNIONS AND VECTOR ANALYSIS. 175 


which I have used by quoting a sentence addressed to the British 
Association a few years ago. The speaker was Lord Kelvin. 

“Helmholtz first solved the problem—Given the spin in any case 
of liquid motion, to find the motion. His solution consists in finding 
the potentials of three ideal distributions of gravitational matter 
having densities respectively equal to 1/2 of the rectangular com- 
ponents of the given spin; and, regarding for a moment these 
potentials as rectangular components of velocity in a case of liquid 
motion, taking the spin in this motion as the velocity in the required 
motion” (Nature, vol. xxxviii, p. 569). 

In the terms and notations of my pamphlet the problem and 
solution may be thus expressed : 

Given the curl in any case of liquid motion—to find the motion. 

The required velocity is 1/4m of the curl of the potential of the 
given curl. 

Or, more briefly—The required velocity is ~- of the Laplacian 
of the given curl. a 

Or in purely analytical form—Required w in terms of Vx, when 
V.o=0. 

Solution : w=1/47V x Pot Vxw=1/47 Lap Vxe. 

(The Laplacian expresses the result of an operation like that by 
which magnetic force is calculated from electric currents distributed 
in space. This corresponds to the second form in which Helmholtz 
expressed his result.) 

To show the incredible rashness of my critics, I will remark that 
these equations are among those of which it is said in the original 
paper (Proc. R.S.EL., Session 1892-93, p. 225), “Gibbs gives a good 
many equations—theorems I suppose they ape at being.” I may add 
that others of the equations thus characterized are associated with 
names not less distinguished than that of Helmholtz. But that to 
which I wish especially to call attention is that the terms and 
notations in question express exactly the notions which physicists 
want to use. 

But we are told (Nature, vol. xlvii, p. 287) that these integrating 
operators (Pot, Lap) are best expressed as inverse functions of V. 
To see how utterly inadequate the Nabla would have been to express 
the idea, we have only to imagine the exclamation points which the 
members of the British Association would have looked at each other 
if the distinguished speaker had said: 

Helmholtz first solved the problem—Given the Nabla of the velocity 
in any case of liquid motion, to find the velocity. His solution was 
that the velocity was the Nabla of the inverse square of Nabla of 
the Nabla of the velocity. Or, that the velocity was the inverse 
Nabla of the Nabla of the velocity. 
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Or, if the problem and solution had been written thus : Required w 
in terms of Vw when SVw=0. 

Solution : wo=VV-"Vo=V-'!Vo. 

My critic has himself given more than one example of the unfitness 
of the inverse Nabla for the exact expression of thought. For 
example, when he says that I have taken “eight distinct steps to 
prove two equations: which are special cases of 

VeVaE=U,” 

I do not quite know what ne means. If he means that I have taken 
eight steps to prove Poisson’s Equation (which certainly is not ex- 
pressed by the equation cited, although it may perhaps be associated 
with it in some minds), I will only say that my proof is not very 
long, especially as I have aimed at greater rigor than is usually 
thought necessary. I cannot, however, compare my demonstration 
with that of quaternionic writers, as I have not been able (doubtless 
on account of insufficient search) to find any such. 

To show how little foundation there is for the charge that the 
deficiencies of my system require to be pieced out by these integral 
operators, I need only say that if I wished to economise operators I 
might give up New, Lap, and Max, writing for them V Pot, Vx Pot, 
and V.Pot, and if I wished further to economise in what costs so little, 
I could give up the potential also by using the notation (V.V)-? or 
V-?. That is, I could have used this notation without greater sacrifice 
of precision than quaternionic writers seem to be willing to make. I 
much prefer, however, to avoid these inverse operators as essentially 
indefinite. 

Nevertheless—although my critic has greatly obscured the subject 
by ridiculing operators, which I beg leave to maintain are not worthy 
of ridicule, and by thoughtlessly asserting that it was necessary for 
me to use them, whereas they are only necessary for me in the sense 
in which something of the kind is necessary for the quaternionist also, 
if he would use a notation irreproachable on the score of exactness— 
I desire to be perfectly candid. I do not wish to deny that the 
relations connected with these notations appear a little more simple 
in the quaternionic form. I had, indeed, this subject principally in 
mind when I said two years ago in Nature (vol. xliii, p. 512) [this vol. 
p. 158]: “There are a few formule in which there is a trifling gain 
in compactness in the use of the quaternion.” Let us see exactly 
how much this advantage amounts to. 

There is nothing which the most rigid quaternionist need object to 
in the notation for the potential, or indeed for the Newtonian. These 
represent respectively the operations by which the potential or the 
force of gravitation is calculated from the density of matter. A 
quaternionist would, however, apply the operator New not only to a 
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scalar, as I have done, but to a vector also. The vector part of New w 
(construed in the quaternionic sense) would be exactly what I have 
represented by Lap , and the scalar part, taken negatively, would be 
exactly what I have represented by Maxw. The quaternionist has 
here a slight economy in notations, which is of less importance, since 
all the operators—New, Lap, Max—may be expressed without 
ambiguity in terms of the potential, which is therefore the only one 
necessary for the exact expression of thought. 

But what are the formule which it is necessary for one to remember 
who uses my notations? Evidently only those which contain the 
operator Pot. For all the others are derived from these by the simple 


substitutions New =V Pot, 
Lap =VxPot, 
Max = V. Pot. 


Whether one is quaternionist or not, one must remember Poisson’s 
Equation, which I write 
V.V Pot w= —47, 
and in quaternionic might be written 
V? Pot w=47r0. 
If is a vector, in using my equations one has also to remember the 
general formule, 
V.Vo=VV.0e—-VxVxe 
which as applied to the present case may be united with the preceding 
in the three-membered equation, 


V.V Pot o= VV. Potw—VxVx Pot w= —47w. 


This single equation is absolutely all that there is to burden the 
memory of the student, except that the symbols of differentiation 
(V, Vx, V.) may be placed indifferently before or after the symbol 
for the potential, and that if we choose we may substitute as above 
New for V Pot, ete. Of course this gives a good many equations, 
which on account of the importance of the subject (as they might 
almost be said to give the mathematics of the electro-magnetic field) 
I have written out more in detail than might seem necessary. I have 
also called the attention of the student to many things, which perhaps 
he might be left to himself to see. Prof. Knott says that the quater- 
nionist obtains similar equations by the simplest transformations. 
He has failed to observe that the same is true in my Vector Analysis, 
when once I have proved Poisson’s Equation. Perhaps he takes his 
model of brevity from Prof. Tait, who simplifies the subject, I believe, 
in his treatise on Quaternions, by taking this theorem for granted. 
Nevertheless, since I am forced so often to disagree with Prof. 
Knott, I am glad to agree with him when I can. He says in his 
G. IL M 


178 QUATERNIONS AND VECTOR ANALYSIS. 


original paper (p. 226), “No finer argument in favour of the real 
quaternion vector analysis can be found than in the tangle and the 
jangle of sections 91 to 104 in the Elements of Vector Analysis.” 
Now I am quite ready to plead guilty to the tangle. The sections © 
mentioned, as is sufficiently evident to the reader, were written at 
two different times, sections 102-104 being an addition after a couple 
of years. The matter of these latter sections is not found in its 
natural place, and the result is well enough characterised as a tangle. 
It certainly does credit to the conscientious study which Prof. Knott 
has given to my pamphlet, that he has discovered that there is a 
violent dislocation of ideas just at this point. For such a fault of 
composition I have no sufficient excuse to offer, but I must protest 
against its being made the ground of any broad conclusions in regard 
to the fundamental importance of the quaternion. 

Prof. Knott next proceeds to criticise—or at least to ridicule—my 
treatment of the linear vector function, with respect to which we read 
in the abstract :—“ As developed in the pamphlet, the theory of the 
dyadic goes over much the same ground as is traversed in the last 
chapter of Kelland and Tait’s Introduction to Quaternions. With 
the exception of a few of those lexicon products, for which Prof. 
Gibbs has such an affection, there is nothing of real value added to 
our knowledge of the linear vector function.” It would not, I think, 
be difficult to show some inaccuracy in my critic’s characterisation 
of the real content of this part of my pamphlet. But as algebra is 
a formal science, and as the whole discussion is concerning the best 
form of representing certain kinds of relations, the important question 
would seem to be whether there is anything of formal value in my 
treatment of the linear vector function. 

Now Prof. Knott distinctly characterises in half a dozen words the 
difference in the spirit and method of my treatment of this subject 
from that which is traditional among quaternionists, when he says 
of what I have called dyadics—“these are not quantities, but operators” 
(Wature, vol. xlvii, p. 592). I do not think that I applied the word 
quantity to the dyadics, but Prof. Knott recognised that I treated 
them as quantities—not, of course, as the quantities of arithmetic, or 
of ordinary algebra, but as quantities in the broader sense, in which, 
for example, quaternions are called quantities. The fact that they 
may be operators does not prevent this. Just as in grammar verbs 
may be taken as substantives, viz., in the infinitive mood, so in algebra 
operators—especially such as are capable of quantitative variation— 
may be regarded as quantities when they are made. the subject of 
algebraic comparison or operation. Now I would not say that it is 
necessary to treat every kind of operator as quantity, but I certainly 
think that one so important as the linear vector operator, and one 
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which lends itself so well to such broader treatment, is worthy of it. 
Of course, when vectors are treated by the methods of ordinary 
algebra, linear vector operators will naturally be treated by the same 
‘methods, but in an algebra formed for the sake of expressing the 
relations between vectors, and in which vectors are treated as multiple 
quantities, it would seem an incongruity not to apply the methods 
of multiple algebra also to the linear vector operator. 

The dyadic is practically the linear vector operator regarded as 
quantity. More exactly it is the multiple quantity of the ninth order 
which affords various operators according to the way in which it is 
applied. I will not venture to say what ought to be included in a 
treatise on quaternions, in which, of course, a good many subjects 
would have claims prior to the linear vector operator; but for the 
purposes of my pamphlet, in which the linear vector operator is one 
of the most important topics, I cannot but regard a treatment like 
that in Hamilton’s Lectures, or Elements, as wholly inadequate on 
the formal side. To show what I mean, I have only to compare 
Hamilton’s treatment of the quaternion and of the linear vector 
operator with respect to notations. Since quaternions have been 
identified with matrices, while the linear vector operator evidently 
belongs to that class of multiple quantities, it seems unreasonable to 
refuse to the one those notations which we grant to the other. Thus, 
if the quaternionist has e%, log q, sing, cos q, why should not the vector 
analyst have e®, log ®, sin ®, cos ®, where ® represents a linear vector 
operator? I suppose the latter are at least as useful to the physicist. 
I mention these notations first, because here the analogy is most 
evident. But there are other cases far more important, because more 
elementary, in which the analogy is not so near the surface, and 
therefore the difference in Hamilton’s treatment of the two kinds of 
multiple quantity not so evident. We have, for example, the tensor 
of the quaternion, which has the important property represented by 
the equation: T(qr)=TqTr. 

There is a scalar quantity related to the linear vector operator, 
which I have represented by the notation |@| and called the 
determinant of ®. It is in fact the determinant of the matrix by 
which ® may be represented, just as the square of the tensor of q 
(sometimes called the norm of q) is the determinant of the matrix by 
which g may be represented. It may also be defined as the product 
of the latent roots of ®, just as the square of the tensor of g might be 
defined as the product of the latent roots of g. Again, it has the 
property represented by the equation 

|e. |=|8||¥| 
which corresponds exactly with the preceding equation with both 
sides squared. 
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There is another scalar quantity connected with the quaternion and 
represented by the notation Sg. It has the important property 
expressed by the equation, 

S(qrs)=S(rsq)=S(sqr), 
and so for products of any number of quaternions, in which the cyclic 
order remains unchanged. In the theory of the linear vector operator 
there is an important quantity which I have represented by the 
notation ,, and which has the property represented by the equation 


(®.¥.0),=(¥.0.8),=(2.8.V)s, 


where the number of the factors is as before immaterial. &g may be 
defined as the sum of the latent roots of , just as 2Sq may be 
defined as the sum of the latent roots of q. 

The analogy of these notations may be further illustrated by . 

comparing the equations 
a q Taya 
and | e® | = es, 

I do not see why it is not as reasonable for the vector analyst to 
have notations like |®| and @g, as for the quaternionist to have the 
notations Tq and Sq. 

This is of course an argumentum ad quaternionisten. I do not 
pretend that it gives the reason why I used these notations, for the 
identification of the quaternion with a matrix was, I think, unknown 
to me when I wrote my pamphlet. The real justification of the 
notations || and ®, is that they express functions of the linear 
vector operator qud quantity, which physicists and others have 
continually occasion to use. And this justification applies to other 
notations which may not have their analogues in quaternions. Thus 
I have used ©, to express a vector so important in the theory of the 
linear vector operator, that it can hardly be neglected in any treatment 
of the subject. It is described, for example, in treatises as different as 
Thomson and Tait’s Natural Philosophy and Kelland and Tait’s 
Quaternions. In the former treatise the components of the vector 
are, of course, given in terms of the elements of the linear vector 
operator, which is in accordance with the method of the treatise. In 
the latter treatise the vector is expressed by 


Vad’ +VBO'+Vyy’. 

As this supposes the linear vector operator to be given not by a 
single letter, but by several vectors, it must be regarded as entirely 
inadequate by any one who wishes to treat the subject in the spirit of 
multiple algebra, ae. to use a single letter to represent the linear 
vector operator. 

But my critic does not like the notations ||, bs, 6. His ridicule, 
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indeed reaches high-water mark in the paragraphs in which he men- 
tions them. Concerning another notation, x (defined in Natwre, vol. 
xlili, p. 513) [this vol., p. 160], he exclaims, “Thus burden after burden, 
in the form of new notation, is added apparently for the sole purpose 
of exercising the faculty of memory.” He would vastly prefer, it 
would appear, to write with Hamilton mq’-', “where m represents 
what the unit volume becomes under the influence of the linear 
operator.” But this notation is only apparently compact, since the 
m requires explanation. Moreover, if a strain were given in what 
Hamilton calls the standard trinomial form, to write out the formula 
for the operator on surfaces in that standard form by the use of the 
expression m¢’-* would require, it seems to me, ten (if not fifty) times 
the effort of memory and of ingenuity, which would be required for 
the same purpose with the use of Lb r, 

I may here remark that Prof. Tait’s letter of endorsement of Prof. 
Knott’s paper affords a striking illustration of the convenience and 
flexibility of a notation entirely analogous to ®z®, viz, ®:6. He 
gives the form SVV, Sco, to illustrate the advantage of quaternionic 
notations in point of brevity. If I understand his notation, this is 
what I should write Vo:Vo. (I take for granted that the suffixes 
indicate that V applies as differential operator to co, and V, to aj, 
o and o, being really identical in meaning, as also V and V,.) It 
will be observed that in my notation one dot unites in multiplication 
_ the two V’s, and the other the two o’s, and that I am able to leave 
each V where it naturally belongs as differential operator. The 
quaternionist cannot do this, because the V and o cannot be left 
together without uniting to form a quaternion, which is not at all 
wanted. Moreover, I can write ® for Vo, and ®:@ for Vo:Vo. The 
quaternionist also uses a ¢, which is practically identical with my ® 
(viz., the operator which expresses the relation between do and dp), 
but I do not see how Prof. Knott, who I suppose dislikes @:@ as 
much as x, would express SVV, Sco, in terms of this ¢. 

It is characteristic of Prof. Knott’s view of the subject, that in 
translating into quaternionic from a dyadic, or operator, as he calls it, 
he adds in each case an operand. In many cases it would be difficult 
to make the translation without this. But it is often a distinct 
advantage to be able to give the operator without the operand. For 
example, in translating into quaternionic my dyadic or operator ®xp, 
he adds an operand, and exclaims, “The old thing!” Certainly, when 
this expression is applied to an operand, there is no advantage (and 
no disadvantage) in my notation as compared with the quaternionic. 
But if the quaternionist wished to express what I would write in the 
form (@xp)~1, or |@xp|, or (&xp)g, or (@xp)x, he would, I think, 
find the operand very much in the way. 
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ON DOUBLE REFRACTION AND THE DISPERSION OF 
COLORS IN PERFECTLY TRANSPARENT MEDIA. 


[American Journal of Science, ser. 3, vol. XXIII, pp. 262-275, April, 1882.] 


1. IN calculating the velocity of a system of plane waves of homo- 
geneous light, regarded as oscillating electrical fluxes, in transparent 
and sensibly homogeneous bodies, whether singly or doubly refracting, 
we may assume that such a body is a very fine-grained structure, so 
that. it can be divided into parts having their dimensions very small 
in comparison with the wave-length, each of which may be regarded 
as entirely similar to every other, while in the interior of each there 
are wide differences in electrical as in other physical properties. 
Hence, the average electrical displacement in such parts of the body 
may be expressed as a function of the time and the coordinates of 
position by the ordinary equations of wave-motion, while the real ° 
displacement at any point will in general differ greatly from that 
represented by such equations. : 

It is the object of this paper to investigate the velocity of light in 
perfectly transparent media which have not the property of circular 
polarization in a manner which shall take account of this difference 
between the real displacements and those represented by the ordinary 
equations of wave-motion. We shall find that this difference will 
account for the dispersion of colors, without affecting the validity 
of the laws of Huyghens and Fresnel for double refraction with 
respect to light of any one color. 

In this investigation, it is assumed that the electrical displacements 
are solenoidal, or, in other words, that they are such as not to produce 
any change in electrical density. The disturbance in the medium is 
treated as consisting entirely of such electrical displacements and 
fluxes, and not complicated by any distinctively magnetic phenomena. 
It might therefore be more accurate to call the theory (as here 
developed) electrical rather than electromagnetic. The latter term is 
nevertheless retained in accordance with general usage, and with that 
of the author of the theory. 

Since the velocity which we are seeking is equal to the wave-length 
divided by the period of oscillation, the problem reduces to finding 
the ratio of these quantities, and may be simplified in some respects 
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by supposing that we have to do with a system of stationary waves, 
That the relation of the wave-length and the period is the same for 
stationary as for progressive waves is evident from the consideration 
that a system of stationary waves may be formed by two systems of 
progressive waves having opposite directions. 

2. Let x, y, z be the rectangular coordinates of any point in the 
medium, which with the system of waves we may regard as inde- 
finitely extended, and let €+£’, +7’, +¢’ be the components of 
electrical displacement at that point at the time ¢; & 9, ¢ being the 
average values of the components of electrical displacement at that 
time in a wave-plane passing through the point. Then & 7, &, 7’, & 
are perfectly defined quantities, of which & 7, € are connected with 
x, y, z, and t¢ by the ordinary equations of wave-motion, while each 
of the quantities £’, n’, &’ has always zero for its average value in any © 
wave-plane. We may call €, 7, ¢ the components of the regular part 
of the displacement, and é’, 7’, ¢’ the components of the irregular 
part of the displacement. In like manner, the differential coefficients 
of these quantities with respect to the time, é, 7; Ne & 7, és may be 
called respectively the components of the regular part of the flux, 
and the components of the irregular part of the flux. 

Let the whole space be divided into elements of volume Dv, very 
small in all dimensions in comparison with a wave-length, but 
enclosing portions of the medium which may be treated as entirely 
similar to one another, and therefore not infinitely small. Thus a 
erystal may be divided into elementary parallelopipeds, all the vertices 
of which are similarly situated with respect to the internal structure 
of the erystal. Amorphous solids and liquids may not be capable of 
division into equally small portions of which physical similarity can 
be predicated with the same rigor. Yet we may suppose them capable 
of a division substantially satisfying the requirements. 

From these definitions it follows that at any given instant the 
average value of each of the quantities ¢’, 7’, ¢’ in an element Dv is 
zero. For the average value in one such element must be sensibly 
the same as in any other situated on the same wave-plane. If this 
average were not zero, the average for the wave-plane would not be 
zero. Moreover, at any given instant, the values of €, 7, ¢ may be 
regarded as constant throughout any element Dv, and as representing 
the average values of the components of displacement in that element. 
The same will be true of the quantities £’, 7’, ¢ and €, , €. 

3. Since we have excluded the case of media which have the pro- 
perty of circular polarization, we shall not impair the generality of 
our results if we suppose that we have to do with linearly polarized 
light, ie. that the regular part of the displacement is everywhere 
parallel to the same fixed line, all cases not already excluded being 
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reducible to this. Then, with the origin of coordinates and the zero 
of time suitably chosen, the regular part of the displacement may be 
represented by the equations 


L 


t 
€=acos Qa” COS es ; 


Ww t 
= xe 1 
n = 8 cos 207 cos 27r- , (1) 


Ub t 
= cos 27- cos 27- , 
(=y I p 


where / denotes the wave-length, p the period of vibration, a, B, y 
the maximum amplitudes of the displacements €, 7, ¢, and w the 
distance of the point considered from the wave-plane which passes 
through the origin. Since w is a linear function of #, y, and 2, we 
may regard these equations as giving the values of ¢, 9, ¢, for a given 
system of waves, in terms of #, y, 2, and ¢. 

4, The components of the irregular displacement, €’, 77, ¢, at any 
given point, will evidently be simple harmonic functions of the time, 
having the same period as the regular part of the displacement. That 
they will also have the same phase is not quite so evident, and would 
not be the case in a medium in which there were any absorption or 
dispersion of light. It will however appear from the following con- 
siderations that in perfectly transparent media the irregular oscil- 
lations are synchronous with the regular. For if they are not 
synchronous, we may resolve the irregular oscillations into two parts, 
of which one shall be synchronous with the regular oscillations, and 
the other shall have a difference of phase of one-fourth of a complete 
oscillation. Now if the medium is one in which there is no absorption 
or dispersion of light, we may assume that the same electrical con- 
figurations may also be passed through in the inverse order, which 
would be represented analytically by writing —t for ¢ in the equations 
which give €, 7, ¢ &, 7’, &, as functions of a, y, z, and ¢. But this 
change would not affect the regular oscillations, nor the synchronous 
part of the irregular oscillations, which depends on the cosine of the 
time, while the non-synchronous part of the irregular oscillations, 
which depends on the sine of the time, would simply have its 
direction reversed. Hence, by taking first one-half the sum, and 
secondly one-half the difference, of the original motion and that 
obtained by substitution of —¢ for ¢, we may separate the non- 
synchronous part of the irregular oscillations from the rest of the 
motion. Therefore, the supposed non-synchronous part of the irregular 
displacement, if capable of existence, is at least wholly independent 
of the wave-motion and need not be considered by us. . 

We may go farther in the determination of the quantities Re ipa: SF. 
For in view of the very fine-grained structure of the medium, it will 
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easily appear that the manner in which the general or average flux 
in any element Dv (represented by & 7, €) distributes itself among 
the molecules and intermolecular spaces must be entirely determined 
by the amount and direction of that flux and its period of oscillation. 
Hence, and on account of the superposable character of the motions 
which we are considering, we may conclude that the values of £’, 7/, & 
at any given point in the medium are capable of expression as linear 
functions of ¢, 7, € in a manner which shall be independent of the 
time and of the orientation of the wave-planes and the distance of 
a nodal plane from the point considered, so long as the period of 
oscillation remains the same. But a change in the period may 
presumably affect the relation between £’, 7’, & and &, , ¢ to a certain 
extent. And the relation between £, 7’, & and & 7, € will vary 
rapidly as we pass from one point to another within the element Dv. 

5. In the motion which we are considering there occur alternately 
instants of no velocity and instants of no displacement. The statical 
energy of the medium at an instant of no velocity must be equal to 
its kinetic energy at an instant of no displacement. Let us examine 
each of these quantities, and consider the equation which expresses 
their equality. 

6. Since in every part of an element Dv the irregular as well as the 
regular part of the displacement is entirely determined (for light of 
a given period) by the values of & 4, ¢, the statical energy of the 
element must be a quadratic function of € y, § say 


(AE? + By? + CG? + Eng + F&E+ GEq) Dv, 
where A, B, etc. depend only on the nature of the medium and the 
period of oscillation. At an instant of no velocity, when 
sin 27 ki =0, and cos?27 ce LG 
P Pp 
the above expression will reduce by equations (1) to 
(Ac?-+ BB?+ Cy2+EBy + Pya+GaB)cos*2x * Do. 


Since the average value of cos? Qn in an indefinitely extended 


space is }, we have for the statical energy in a unit of volume 
S =4(Ac?+BG?+Cy?+ EBy+Fya+Ga8). (2) 
7. The kinetic energy of the whole medium is represented by the 
double volume-integral* 


1S { fet fet Oey a 


* The fluxes are supposed to be measured by the electromagnetic system of units. It 
is to be observed that the difference of opinion which has prevailed with respect to the 
estimation of the energy of electrical currents does not extend to such as are solenoidal, 


which may be regarded as composed of closed circuits. 
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where dv,, dv, are two infinitesimal elements of volume, (Ere \e 
(é+2), the corresponding components of flux, 7 the distance between 
the elements, and > denotes a summation with respect to the 
coordinate axes. Separating the integrations, we may write for 


the same quantity ae 
32 [c242,| [E+E an, |aoy. 


It is evident that the integral within the brackets is derived from 
é+£’ by the same process by which the potential of any mass is 
derived from its density. If we use the symbol Pot to express this 
relation, we may write for the kinetic energy 


BES (E+ £') Pot (E+ dv. 
The operation denoted by this symbol is evidently distributive, 
so that Pot(€é+é’)=Poté+Pot €’. The expression for the kinetic 
energy may therefore be expanded into 


LDS é Pot Edvu+42/E Pot & dv+42/€ Pot Edv+42/é Pot & dv. 


But €’, and therefore Pot é’, has in every wave-plane the average 
value zero, Also €, and therefore Pot € has in every wave-plane a 
constant value. Therefore the second and third integrals in the 
above expression will vanish, leaving for the kinetic energy 


4D /EPot Edv+4d/ Pot & dr, (3) 


which is to be calculated for a time of no displacement, when 


I] 


p L? i l 

The form of the expression (3) indicates that the kinetic energy 
consists of two parts, one of which is determined by the regular part 
of the flux, and the other by the irregular part of the flux. 

8. The value of Pot € may be easily found by integration, but 
perhaps more readily by Poisson’s well-known theorem, that if g is 
any function of position in space (as the density of a certain mass), 

d*Potq d?Potgq , d* Potg 
dat dy? a ie | —47q, (5) 
where the direction of the coordinate axes is immaterial, provided 
that they are rectangular. In applying this to Pot € we may place 
two of the axes in a wave-plane. This will give 
d? Pot € , 
spt = — 4rd (6) 

In a nodal plane, Pot €=0, since é has equal positive and negative 

values in elements of volume symmetrically distributed with respect 


2 2 . 2 : 
f= TS cos 2a 7, 2 = al SE PO he f= +22 cos 2 F (4) 
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to any point in such a plane. In a wave-crest (or plane in which 
€ has a maximum value), Pot¢ will also have a maximum value, 
which we may call K. For intermediate points we may determine 
its value from the consideration that the total disturbance may be 
resolved into two systems of waves, one having a wave-crest, and the 
other a nodal plane passing through the point for which the potential 
is sought. The maximum amplitudes of these component systems 
will be to the maximum amplitude of the original system as 
cos Qa and sin Qa to unity. But the second of the component 
systems will contribute nothing to the value of the potential. We 
thus obtain 


Pot €=K cos Qn, 


l 

@Poté An? u_ 4n7 

> mas — 77 Keos 27 = oe Pot &. 
Comparing this with equation (6), we have 

Ava? ; : 
Tie Pot é= — 476, 
Pong=_- 2 7 
ot E= -t: ( ) 


Hence, and by equations (4), 


é : ? = 2 rl? j 
$B /E Pot Edu= 5 B/S dv=— (P+ By) feos? 2x 5 de. 


The kinetic energy of the regular part of the flux is therefore, for 
each unit of volume, 


T al 2 2 2 8 
ie et (8) 


9. With respect to the kinetic energy of the irregular part of the 
flux, it is to be observed that, since é’, n, ie have their average values 
zero in spaces which are very small in comparison with a wave-length, 
the integrations implied in the notations Pot é’, Pot 7, Pot ¢ may be 
confined to a sphere of a radius which is small in comparison with a 
wave-length. Since within such a sphere €’, 7’, (’ are sensibly deter- 
mined by the values of 7, ¢ at the center of the sphere, which is the 
point for which the value of the potentials are sought, Pot £’, Pot’, 
Pot ¢’ must be functions—evidently linear functions—of €, 4, ¢; and 
&’ Pot £’, of Pot a’, & Pot ¢’ must be quadratic functions of the same 
quantities. But these functions will vary with the position of the 
point considered with reference to the adjacent molecules. 

Now the expression for the kinetic energy of the irregular part 
of the flux, ; : 

41> Pot £’ do, 
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indicates that we may regard the infinitesimal element dv as having 
the energy (due to this part of the flux) 
LD £ Pot £’ du. 

Let us consider the energy due to the irregular flux which will belong 
to the above defined element Dv, which is not infinitely small, but 
which has the advantage of being one of physically similar elements 
which make up the whole medium. The energy of this element is 
found by adding the energies of all the infinitesimal elements of 
which it is composed. Since these are quadratic functions of the 
quantities £, 4, € which are sensibly constant throughout the element 
Dv, the sum will be a quadratic function of €, 7, € say 

(NE+BP+OC+E C+ FCE+ Gq) Dv, 
which will therefore represent the energy of the element Dv due to 
the irregular flux. The coefficients A’, B’, etc., are determined by the 
nature of the medium and the period of oscillation. They will be 
constant throughout the medium, since one element Dv does not differ 
from another. 

This expression reduces by equations (4) to 


4, 2 / 7 7 / 7 A 
pe a? + BB? + C'y?+ E’By+F’ya+G’af) cos? Qn Dv. 


The kinetic energy of the irregular flux in a unit of volume is there- 
fore 


/ 2 : 7 A ee ta "¢ 
T= (A+ BB+ Cy +E byt Fya+ Gag). (9) 
10. Equating the statical and kinetic energies, we have 
ie as 
To (a+ B+!) +24 (NO+B B+ Cy +E By +Fyat Cas), (10) 


The es (V) of - corresponding system of progressive waves is 
given by the equation 


viol 1 Aa+ BB 4 Cy+BBy +P yat Gap 


por e+ B+ 
_ 20 2a A’o?+BiP2+C'y? +E By+F’ya+Q’ af 
ree e+ ety os 
If we set 
ct a 1 oT 
teas ied bas Be 5B’, etc (12) 
and p?=a2 +8? +y%, 


the equation reduces to 


y2 ate + bB'+ cy? + By +fyatgaB 
aa See earn (13) 
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For a given medium and light of a given period, the coefficients a, b, 
etc., are constant. 

This relation between the velocity of the waves and the direction of 
oscillation is capable of a very simple geometrical expression. Let r 
be the radius vector of the ellipsoid 


aa? + by? + cz? + eye + foat+gay =1. (14) 
Then 1 _ aa? + by? +c2* + eys+ feat gay 
r r ; 
If this radius is drawn parallel to the electrical oscillations, we shall 
have 
Oe pe ee 
ay agen a ee tm 
ae 
and V= = (15) 


That is, the wave-velocity for any particular direction of oscillation 
is represented in the ellipsoid by the reciprocal of the radius vector 
which is parallel to that direction. 

11. This relation between the wave-length, the period, and the 
direction of vibration, must hold true not only of such vibrations as 
actually occur, but also of such as we may imagine to occur under 
the influence of constraints determining the direction of vibration in 
the wave-plane. The directions of the natural or unconstrained 
vibrations in any wave-plane may be determined by the general 
mechanical principle that if the type of a natural vibration is infini- 
tesimally altered by the application of a constraint, the value of the 
period will be stationary.* Hence, in a system of stationary waves 
such as we have been considering, if the direction of an unconstrained 
vibration is infinitesimally varied in its wave-plane by a constraint 
while the wave-length remains constant, the period will be stationary. 
Therefore, if the direction of the unconstrained vibration is infinitesi- 
mally varied by constraint, and the period remains rigorously constant, 
the wave-length will be stationary. Hence, if we make a central 
section of the above described ellipsoid parallel to any wave-plane, 
the directions of natural vibration for that wave-plane will be parallel 
to the radii vectores of stationary value in that section, viz., to the 
axes of the ellipse, when the section is elliptical, or to all radii, when 
the section is circular. 

12. For light of a single period, our hypothesis has led to a 
perfectly definite result, our equations expressing the fundamental 
laws of double refraction as enunciated by Fresnel. But if we ask 
how the velocity of light varies with the period, that is, if we seek 


* See Rayleigh’s Theory of Sound, vol, i, p. 84. 
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to derive from the same equations the laws of the dispersion of colors, 
we shall not be able to obtain an equally definite result, since the 
quantities A, B, ete, and A’, B’, etc., are unknown functions of the 
period. If, however, we make the assumption, which is hardly likely 
to be strictly accurate, but which may quite conceivably be not far 
removed from the truth, that the manner in which the general or 
average flux in any small part of the medium distributes itself among 
the molecules and intermolecular spaces is independent of the period, 
the quantities A, B, ete, and A’, B’, etc. will be constant, and we 
obtain a very simple relation between V and p, which appears to agree 
tolerably well with the results of experiment. 


2 2 Vaya 
If we set pa ASH pe ey yet (16) 
: Pee) 7 (22 at. - 7 / / 
Aa py SEER ee tee (17) 
our general equation (11) becomes 
H 2H’ 
Se pe (18) 


where H and H’ will be constant for any given direction of oscillation, 
when A, B, etc. and A’, B’, ete., are constant. If we wish to introduce 
into the equation the absolute index of refraction (n) and the wave- 
length in vacuo (A) in place of V and p, we may divide both sides of 
the equation by the square of the constant (k) representing the 
velocity of light in vacuo. Then, since 


ae and kp=, 
our equation reduces to 
it Be ser 
nm Ik? 2 ay 


It is well known that the relation between » and may be tolerably 
well but by no means perfectly represented by an equation of this 
form. 

13. If we now give up the presumably inaccurate supposition that 
A, B, ete, and A’, B’, ete., are constant, equation (19) will still subsist, 
but H and H’ will not be constant for a given direction of oscillation, 
but will be functions of p, or, what amounts to the same, of X. 
Although we cannot therefore use the equation to derive a priori the 
relation between m and \, we may use it to derive the values of H 
and H’ from the empirically determined relation between n and X. 
To do this, we must make use again of the general principle that an 
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infinitesimal variation in the type of a vibration, due to a constraint, 
will not affect the period. If we first consider a certain system of 
stationary waves, then a system in which the wave-length is greater 
by an infinitesimal di (the direction of oscillation remaining the same), 
the period will be increased by an infinitesimal dp, and the manner in 
which the flux distributes itself among the molecules and intermole- 
cular spaces will presumably be infinitesimally changed. But if we 
suppose that in the second system of waves there is applied a con- 
straint compelling the flux to distribute itself in the same way among 
the molecules and intermolecular spaces as in the first system (so that 
€’, 1’, & shall be the same functions as before of § 9, 6—a supposition 
perfectly compatible with the fact that the values of & 7, € are 
changed), this constraint, according to the principle cited, will not 
affect the period of oscillation. Our equations will apply to such a 
constrained type of oscillation, and A, B, ete, and A’, B’, ete, and 
therefore H and H’, will have the same values in the last described 
system of waves as in the first system, although the wave-length and 
the period have been varied. Therefore, in differentiating equation 
(18), which is essentially an equation between / and p, or its equivalent 
(19), we may treat H and H’as constant. This gives 


2 dn _ 47H’ 
wok fA 
We thus obtain the values of H’ and H 
, WB dn Qrk? Ark®rA dn 9 
eseree a ae oa aN oe 


By determining the values of H and H’ for different directions of 
oscillation, we may determine the values of A, B, ete, and A’, B’, ete. 

By means of these equations, the ratios of the statical energy (S), 
the kinetic energy due to the regular part of the flux (T), and the 
kinetic energy due to the irregular part. of the flux (T’), are easily 
obtained in a form which admits of experimental determination. 
Equations (8) and (9) give 


_ wp ee og 
a se a 
Therefore, by (20), 

T’ 27H’ 27H’n? Adn__dlogn (21) 
fates he 6, Ok Clog A 
Sigikae T dlogr—dlogn_ dlogl 99 
Lar a is d log » d log \’ i 
T_ _dlogn (23) 


192 DOUBLE REFRACTION AND THE DISPERSION OF 


Since S, T, and 1’ are essentially positive quantities. their ratios 
must be positive. Equation (21) therefore requires that the index 
of refraction shall increase as the period or wave-length in vacuo 
diminishes. Experiment has shown no exceptions to this rule, except 
such as are manifestly attributable to the absorption of light. 

14. It remains to consider the relations between the optical pro- 
perties of a medium and the planes or axes of symmetry which it 
may possess. If we consider the statical energy per unit of volume 
(S) and the period as constant, we may regard equation (2) as the 
equation of an ellipsoid, the radii vectores of which represent in 
direction and magnitude the amplitudes of systems of waves having 
the same statical energy. In like manner, if we consider the kinetic 
energy of the irregular part of the flux per unit of volume (T”) and 
the period as constant, we may regard equation (9) as the equation 
of an ellipsoid, the radii vectores of which represent in direction and 
magnitude the amplitudes of systems of waves having the same 
kinetic energy due to the irregular part of the flux. These ellipsoids, 
which we may distinguish as the ellipsoids (A, B, ete.) and (A’, B’, etc.), 
as well as the ellipsoid before described, which we may call the 
ellipsoid (a, 0, ete.), must be independent in their form and their 
orientation of the directions of the axes of coordinates, being deter- 
mined entirely by the nature of the medium and the period of 
oscillation. They must therefore possess the same kind of symmetry 
as the internal structure of the medium. 

If the medium is symmetrical about a certain axis, each ellipsoid 
must have an axis parallel to that. If the medium is symmetrical 
with respect to a certain plane, each ellipsoid must have an axis at 
right angles to that plane. If the medium after a revolution of less 
than 180° about a certain axis is then equivalent to the medium in 
its first position, or symmetrical with it with respect to a plane at 
right angles to that axis, each ellipsoid must have an axis of revolution 
parallel to that axis. These relations must be the same for light of 
all colors, and also for all temperatures of the medium. 

15. From these principles we may infer the optical characteristics 
of the different crystallographic systems. 

In crystals of the isometric system, as in amorphous bodies, the 
three ellipsoids reduce to spheres. Such media are optically isotropic 
at least so far as any properties are concerned which come within the 
scope of this paper. 

In crystals of the tetragonal or hexagonal systems, the three 
ellipsoids will have axes of rotation parallel to the principal crystal- 
lographic axis. Since the ellipsoid (a, b, etc.) has but one circular 


section, there will be but one optic axis, which will have a fixed 
direction. 
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In crystals of the orthorhombic system, the three ellipsoids will 
have their axes parallel to the rectangular crystallographic axes. If 
we take these directions for the axes of coordinates, E, F, G, E’, F’, G’, 
é, f, g will vanish and equation (13) will reduce to 


v2 te tbe? +cy? 

— kk SET Te 
p 

If the coordinate axes are so placed that 


a>b>e, 


the optic axes will lie in the X-Z plane, making equal angles with 
the axis of Z, which may be determined by the equation 


a—b_ p(A—B)—47?(A’—B’) 

b—c p*B—C)—47?(B’—C/) 

To get a rough idea of the manner in which ¢ varies with the period, 
we may regard A, B, C, A’, B’, C’ as constant in this equation. 

But since the lengths of the axes of the ellipsoid (a, b, etc.) vary 
with the period, it may easily happen that the order of the axes 
with respect to magnitude is not the same for all colors. In that 
case, the optic axes for certain colors will lie in one of the principal 
planes, and for other colors in another. For the color at which the 
change takes place, the two optic axes will coincide. The differential 


coefficient a becomes infinitely great as the optic axes approach 


tan? ¢= 


coincidence. 

In erystals of the monoclinic system, each of the three ellipsoids 
will have an axis perpendicular to the plane of symmetry. We may 
choose this direction for the axis of X. Then F,G, F’, G, f, g, will 
vanish and equation (13) will reduce to 

v2 Oe + OB + cy? + eBy 
= : f 
P 
The angle @ made by one of the axes of the ellipsoid (a, 5, etc.) in 
the plane of symmetry with the axis of Y and measured toward the 
axis of Z, is determined by the equation 

ae. pPE—47°EH’ 

c—b p*(C—B)—47(C’—B’/) 

To get a rough idea of the dispersion of the axes of the ellipsoid 
(a, b, etc.) in the plane of symmetry, we may regard B, C, E, B,C, E, 
as constant in this equation, and suppose the axis of Y so placed as 
to make E vanish. 

It is evident that in this system the plane of the optic axes will be 
fixed, or will rotate about one of the lines which bisect the angles 


made by the optic axes, according as the mean axis of the ellipsoid 
@. II. N 


tan 20 = 
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(a, b, ete.) is perpendicular to the plane of symmetry or lies in that 
plane. In the first case the dispersion of the two optic axes will be 
unequal. The same crystal, however, with light of different colors, 
or at different temperatures, may afford an example of each case. 

In crystals of the triclinic system, since the ellipsoids (A, B, ete.) 
and (A’, B’, ete.) are determined by considerations of a different 
nature, and there are no relations of symmetry to cause a coincidence 
in the directions of their axes, there will not in general be any such 
coincidence. Therefore the three axes of the ellipsoid (a, b, ete.), that 
is, the two lines which bisect the angles of the optic axes and their 
common normal, will vary in position with the color of the light. 

16. It appears from this foregoing discussion that by the electro- 
magnetic theory of light we may not only account for the dispersion 
of colors (including the dispersion of the lines which bisect the angles 
of the optic axes in doubly refracting media), but may also obtain 
Fresnel’s laws of double refraction for every kind of homogeneous 
light without neglect of the quantities which determine the dispersion 
of colors. 

But a closer approximation than that of this paper will be neces- 
sary to explain the phenomena of circularly polarizing media, which 
depend on very minute differences of wave-velocity, represented 
perhaps by a few units in the sixth significant figure of the index 
of refraction. That the degree of approximation which will give the 
laws of circular and elliptic polarization will not add any terms to 
the equations of this paper, except such as vanish for media which 
do not exhibit this phenomenon, will be shown in another number 
of this Journal. 


XII. 


ON DOUBLE REFRACTION IN PERFECTLY TRANSPARENT 
MEDIA WHICH EXHIBIT THE PHENOMENA OF CIR- 
CULAR POLARIZATION. 


[American Journal of Science, ser. 3, vol. XXII, pp. 460-476, June, 1882.]} 


1. In the April number of this Journal,* the velocity of propagation 
of a system of plane waves of light, regarded as oscillating electrical 
fluxes, was discussed with such a degree of approximation as would 
account for the dispersion of colors and give Fresnel’s laws of double 
refraction. It is the object of this paper to supplement that discussion 
by carrying the approximation so much further as is necessary in 
order to embrace the phenomena of circularly polarizing media. 

2. If we imagine all the velocities in any progressive system of 
plane waves to be reversed at a given instant without affecting the 
displacements, and the system of wave-motion thus obtained to be 
superposed upon the original system, we obtain a system of stationary 
waves having the same wave-length and period of oscillation as the 
original progressive system. If we then reduce the magnitude of 
the displacements in the uniform ratio of two to one, they will be 
identical, at an instant of maximum displacement, with those of the, 
original system at the same instant. 

Following the same method as in the paper cited, let us especially 
consider the system of stationary waves, and divide the whole dis- 
placement into the regular part, represented by €, y, ¢, and the 
irregular part, represented by €’, 7’, ¢’, in accordance with the 
definitions of § 2 of that paper. 

3. The regular part of the displacement is subject to the equations 
of wave-motion, which may be written (in the most general case 
of plane stationary waves) 


: t 
é= (a, cos 27 +4, sin 27 7) cos Stl : 
. t 
n=(8, cos Qa r+ By sin 27 *) cos ie (1) 


U ; U t 
(= (v1 cos 27 Tt ¥28in 27 ;) cos a : 


* See page 182 of this volume. 
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where 1 denotes the wave-length, p the period of oscillation, u the 
distance of the point considered from the wave-plane passing through 
the origin, a,, 6,, y, the amplitudes of the displacements €, 7, ¢ in 
the wave-plane passing through the origin, and ay, Bo Yq their 
amplitudes in a wave-plane one-quarter of a wave-length distant 
and on the side toward which u increases. If we also write L, M, N 
for the direction-cosines of the wave-normal drawn in the direction 
in which u increases, we shall have the following necessary relations : 


L?+ M?+N?=1, (2) 
u=La+My+Nz, (3) 
La,+ M6,+Ny,=9, La,+Mf,+ Ny.=0. (4) 


4. That the irregular part of the displacement (£, 7’, ¢’) at any 
given point is a simple harmonic function of the time, having the 
same period and phase as the regular part of the displacement (€, 7, ¢), 
may be proved by the single principle of superposition of motions, 
and is therefore to be regarded as exact in a discussion of this kind. 
But the further conclusion of the preceding paper (§ 4), “that the 
values of €’, 7/, € at any given point in the medium are capable of 
expression as linear functions of £€ 4, ¢ in a manner which shall be 
independent of the time and of the orientation of the wave-planes and 
the distance of a nodal plane from the point considered, so long as the 
period of oscillation remains the same,” is evidently only approxima- 
tive, although a very close approximation. A very much closer 
approximation may be obtained, if we regard €’, ’, ¢, at any given 
point of the medium and for light of a given period, as linear functions 
of €, 7, € and the nine differential coefficients 

d&é dn dg dé 
an’ aa ae dy’ ete. 
We shall write £ 7, ¢ and diff. coeff. to denote these twelve quantities. 

From this it follows immediately that with the same degree of 
approximation ¢’, 7’, ¢’ may be regarded, for a given point of the 
medium and light of a given period, as linear functions of é n, ¢ and 
the differential coefficients of £ 4, € with respect to the coordinates. 
For these twelve quantities we shall write 4, € and diff. coeff. 

5. Let us now proceed to equate the statical energy of the medium 
at an instant of no velocity with its kinetic energy at an instant of 
no displacement. It will be convenient to estimate each of these 
quantities for a unit of volume. 

6. The statical energy of an infinitesimal element of volume may be 
represented by o dv, where o is a quadratic function of the components 
of displacement £+ ¢’,7+7',€+¢’. Since for that element of volume 
é’, n’, ( may be regarded as linear functions of én, and diff. coeff, 
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we may regard o-as a quadratic function of €,, € and diff. coeff, or 
as a linear function of the seventy-eight squares and products of these 
quantities. But the seventy-eight coefficients by which this function 
is expressed will vary with the position of the element of volume with 
respect to the surrounding molecules. 

In estimating the statical energy for any considerable space by the 
integral 

if o dv, 


it will be allowable to substitute for the seventy-eight coefficients 
contained implicitly in o their average values throughout the medium. 
That is, if we write s for a quadratic function of & n, & and diff. coeff. 
in which the seventy-eight coefficients are the space-averages of those 
in a, the statical energy of any considerable space may be estimated 
by the integral Efe de, 


(This will appear most distinctly if we suppose the integration to 
be first effected for a thin slice of the medium bounded by two 
wave-planes.) The seventy-eight coefficients of this function s are 
determined solely by the nature of the medium and the period of 
oscillation. 

We may divide s into three parts, of which the first (s,) contains the 
squares and products of &, y, €, the second (s,,) contains the products of 
€, n, € with the differential coefficients, and the third (s,,) contains the 
squares and products of the differential coefficients. It is evident that 
the average statical energy of the whole medium per unit of volume 
is the space-average of s, and that it will consist of three parts, which 
are the space-averages of s,,s,, and s,,, respectively. These parts we 
may call S,,S,, and S,,. Only the first of these was considered in 
the preceding paper. 

Now the considerations which justify us in neglecting, for an 
approximate estimate, the terms of s which contain the differential 
coefficients of €, 7, € with respect to the coordinates, will apply with 
especial force to the terms which contain the squares and products of 
these differential coefficients. Therefore, to carry the approximation 
one step beyond that of the preceding paper, it will only be necessary 
to take account of s and s, and of S, and S_,. 

7. We may set 


3 =A +B? + CP + Ene + Fee + Gen, (5) 
where, for a given medium and light of a given period, A, B, C, E, F, 
G are constant. 
Since the average values of 


d oT Top oe Uw w 
sin’27 7, cos’27 7, sin 2m 7 cos nT 


198 DOUBLE REFRACTION AND CIRCULAR POLARIZATION 


are respectively }, }, and 0, and since at the time to be considered 
cos? 27 Ra =l, 
pP 


it will appear from inspection of equations (1) that 
S, = 4 (Aa,? + BB,? +Cy? + EB; + Fy,a, + Ga,B;) 
+}(Aa?+BB?+ Cy? + EBsy2+ Fysas+ Ga,(:). (6) 


This is the first part of the statical energy of the whole medium 
per unit of volume. 

8, The second part of the statical energy of the whole medium 
per unit of volume (S,) is the space-average of s,, which is a linear 
function of the twenty-seven products of &, 4, € with their differential 
coefficients with respect to the coordinates. Now since 

dé = He) on ea) 

pF f- =i é Pd oo 

the space-average of such products will be zero, and they will 

contribute nothing to the value of S,. There will be nine of these 

products, in which the same component of displacement appears 

twice. The remaining eighteen products may be divided into pairs 
according to the letters which they contain, as 


, ete., 


A linear function of the eighteen products may also be regarded 
as a linear function of the sums and differences of the products in 


such pairs. But since 
ns d ta d (7 ¢ ) 
ae fs as, 


the terms of s, sie such sums will contribute nothing to the 
value of S,. We have left a linear function of the nine differences 


igs qe Sgn eGe, Egan ge, ete 
(the unwritten expressions being obtained by substituting in the 
denominators dy and dz for dw), which constitutes the part of s, 


that we have to consider. §, is therefore a linear function of the 
space-averages of these nine quantities. But by (8) 


92241 (44k 52) 


and the space-average of this, at a moment of maximum displacement, 
is by (1) 2 
i (Biy2— 7183). 
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By such reductions it appears that 1S, is a linear function of the 
nine products of L, M, N with 


Bry. ha 7182, Y192—- NYo> a, 8, = Bias. 
Now if we set 


0= L(B,y2 — 7182) + M(y,a.—- ay'¥2) + N(a,8,— 8,2), (7) 
we have by (4) and (2) 


LO= Bry2 = 718, , MO= Y1%— UVo> NO= a,8.— Bya., (8) 


Therefore /S,, is a linear function of the nine products of L, M, N 
with LO, MO, NO. That is, JS, is the product of © and a quadratic 
function of L, M and N. We may therefore write 


: 
a r o= SLB: — 7182) + M(y,a2—ayy2)+N(a,8,—Pya2)], (9) 


where @ is a quadratic function of L, M and N, dependent, however, 
on the nature of the medium and the period of oscillation. 

9. It will be useful to consider more closely the geometrical 
significance of the quantity 0. For this purpose it will be convenient 
to have a definite understanding with respect to the relative position 
of the coordinate axes. 

We shall suppose that the axes of X, Y, and Z are related in the 
same way as lines drawn to the right, forward and upward, so that 
a rotation from X to Y appears clockwise to one looking in the 
direction of Z. 

Now if from any same point, as the origin of coordinates, we lay 
off lines representing in direction and magnitude the displacements 
in all the different wave-planes, we obtain an ellipse, which we may 
call the displacement-ellipse.* Of this, one radius vector (p,) will 
have the components a,, 8,, y,;, and another (p,) the components 
dy, Bo, Yo. These will belong to conjugate diameters, each being 
parallel to the tangent at the extremity of the other. The area of 
the ellipse will therefore be equal to the parallelogram of which 
p, and p, are two sides, multiplied by 7. Now it is evident that 
Byy¥2— 180) ¥1%2—UY2> 482—%8, are numerically equal to the pro- 
jections of this parallelogram on the planes of the coordinate axes, 
and are each positive or negative according as a revolution from 
p, tO p2 appears clockwise or counter-clockwise to one looking in 
the direction of the proper coordinate axis. Hence, 0 will be 
numerically equal to the parallelogram, that is, to the area of the 
displacement-ellipse divided by 7, and will be positive or negative 


*This ellipse, which represents the simultaneous displacements in different parts of 
the field, will also represent the successive displacements at any same point in the 
corresponding system of progressive waves. 
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according as a revolution from p, to p, appears clockwise or counter- 
clockwise to one looking in the direction of the wave-normal. Since 
p, and p, are determined by displacements in planes one-quarter of 
a wave-length distant from each other, and the plane to which the 
latter relates lies on the side toward which the wave-normal is drawn, 
it follows that © is positive or negative according as the combination 
of displacements has the character of a right-handed or a left-handed 
screw. 

10. The kinetic energy of the medium, which is to be estimated 
for an instant of no displacement, may be shown as in §7 of the 
former paper (page 185 of this volume) to consist of two parts, of 
which one relates to the regular flux (€, #, ¢), and the other to the 
irregular flux (¢’, 7, ¢’). The first, in the notation of that paper, is 
represented by . 

4 /(é Pot €+7 Pot 4+ € Pot €)dv, 
which reduces to 
leas go nent 
al (+P +O) dv. 


By substitution of the values given by equations (1), we obtain for 
the kinetic energy due to the regular flux in a unit of volume 


[2 
t= peat + BP ty + dg” + Bo? + y2"). (10) 


11. The kinetic energy of the irregular part of the flux is repre- 
sented by the volume-integral 


SHE Pot &+ 9 Pot 7 +& Pot &) dv. 


Now, since é 71, ad are everywhere linear functions of é n, é and 
diff. coeff. (see § 4), and since the integrations implied in the notation 
Pot may be confined to a sphere of which the radius is small in 
comparison with a wave length,* and since within such a sphere &, 9, ¢ 
and diff. coeff. are sufficiently determined (in a linear form), by the 
values of the same twelve quantities at the center of the sphere, it 
follows that Pot €’, Pot 7’, Pot ‘€ must be linear functions of the values 
of £ 1, Cand diff. coeff. at the point for which the potential is sought. 
Hence, 


4(& Pot & + 7 Pot 7 + & Pot ¢’) 


will be a quadratic function of £€ 7, € and diff. coeff. But the 
seventy-eight coefficients by which this function is expressed will 


vary with the position of the point considered with respect to the 
surrounding molecules. 


*See § 9 of the former paper, on page 187 of this volume. 
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Yet, as in the case of the statical energy, we may substitute the 
average values of these coefficients for the coefficients themselves in 
the integral by which we obtain the energy of any considerable space. 
The kinetic energy due to the irregular part of the flux is thus 
reduced to a quadratic function of € 4, € and diff. coeff. which has 
constant coefficients for a given medium and light of a given period. 

The function may be divided into three parts, of which the first 
contains the squares and products of €, #, & the second the products 
of  #, ¢ with their differential coefficients, and the third, which may 
be neglected, the squares and products of the differential coefficients. 

We may proceed with the reduction precisely as in the case of the 
statical energy, except that the ditferentiations with respect to the 
time will introduce the constant factor ile This will give for 
the first part of the kinetic energy of the irregular flux per unit 
of volume 


2 ~ ¢ , / / la , 
T=Ta(A a,’+BBP2+C yit+E Biyit F’y.a,+G a,8;) 
2 2 4 , / 4 / / 
BY pt ag? + BiB? + Cy.’ + E’Boy.t F’yoa,+G’a,8,), (11) 


and for the second part of the same 


47°’ 


4 ie 8 


“ pl 
- oar [L(Bry2— ¥182) + M(yy42— ayy¥2) +N (a,82—Bya,)], (12) 


where A’, B’, C’, E’, F’, G’ are constant, and ®’ a quadratic function of 
L, M, and N, for a given medium and light of a given period. 
12. Equating the statical and kinetic energies, we have 


$4+8,=T+T)+T,, 
that is, by equations (6), (9), (10), (11), and (12), 
$(Aa,?+ BB? + Cy,?+ EB,y, + Fy,,+ Ga,f,) 
+4(Aa,?+ BB? + Cy? + EByyet Fyoa.+ Ga.) 


os = [L(B,y2 — 7182) + M(y1a,- ayY2)+ N(a,8,— Bya2)] 


2 

= Te (a2 +Betyitaet B+ ys! 

= ae (A’a,’ a BB? oe Cy; + EBy, —-F ‘Yih + G’a,8;) 
ue a (A’a," +B’ rs “F C’y,? Ee E’Boy2 ag B'yoa, 7 G’ a5) 


we [L(Byy2- 7182) + M(y142- ayY2) + N(a,8,—Ba,)]|. (18) 


er 
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If we set gah ak as eee te., (14). 
2r = pp 2a 
pS Ar’ 
and 9 orp pe” (12) 


the equation reduces to 
aa? +bB?+ey?+eBiy + init gah 
+ ag? +b? + Cy? + eBay. t+SY 202+ 9 a8. 


+ 229 1 (Byy.— 7180) + M(ya— ary) +N (ay82— B,a.)] 


2 
= Flatt Bit ytt att Bey) (16) 


where a, b, c, ¢, f, g are constant, and @ a quadratic function of 
L, M, N, for a given medium and light of a given period. 

13. Now this equation, which expresses a relation between the 
constants of the equations of wave-motion (1), will apply, with those 
equations, not only to such vibrations as actually take place, but also 
to such as we may imagine to take place under the influence of 
constraints determining the type of vibration. The free or uncon- 
strained vibrations, with which alone we are concerned, are charac- 
terized by this, that infinitesimal variations (by constraint) of the 
type of vibration, that is, of the ratios of the quantities a,, 6,, y,, 
dy, Bor Yq. Will not affect the period by any quantity of the same 
order of magnitude.* These variations must however be consistent 
with equations (4), which require that 

Lda,+Md6,+Ndy,=0, Lda,+Md8,+Ndy,=0. (17) 
Hence, to obtain the conditions which characterize free vibration, 
we may differentiate equation (16) with respect to a,, B,, 71, a2, Be, Yo 
regarding all other letters as constant, and give to da,, d,, dy,, 
da,, dG,, dy,, such values as are consistent with equations (17). 
Now da,, d8,, dy,, are independent of da,, d8,, dy,, and for either 
_ three variations, values proportional either to a,, B,, y,, Or t0 a2, By, Yo, 
are possible. If, then, we differentiate equation (16) with respect to 
a,, 8,, y,, and substitute first a,, 8,, y,, and then a,, B,, y,, for 
da,, 8,, dy,, and also differentiate with respect to a,, 8,, y,, with 
similar substitutions, we shall obtain all the independent equations 
which this principle will yield. 

If we differentiate with respect to a,, 8,, y,, and write a,, Bus V1 
for da,, dB,, dy,, we obtain 


day +bBP+cy?+eByy, + fra, +9a,8, 
he [L (Bry. a 7182) i M(ya2 = ayY2) +N (a,B, = Bya5)] 


he , 
yee + By ++;’). (18) 


* Compare § 11 of the former paper, page 189 of this volume. 
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If we differentiate with respect to a,, 8,, y,, and write ag, Ba, \¥_ for 
da,, dB,, dy,, we obtain 


2aa,a,+ 2b8,8,+ 2oyi¥ete(Byyot ¥iBo) +S (yia2+ ay) 
21? 
+9(a,8.+ Ba2) ~ pe (aya,+ BB+ yr»): (19) 


_ If we differentiate with respect to a), 8, y2, and write ay, B,, y, for 
da,, dB,, dy, we obtain 


aa,’ + bB,? + cy,?+ eBsyotfyoagtg a,B, 
+e [L(Byy2 = 7182) +M (109 = ayY2)+ N (2,8, = Byay)] 


2 
= aN + 8,?+ 2”). (20) 


The equation derived by differentiating with respect to a,, B,, y,, and 
writing a,, 8,, y, for da,, d8,, dy, is identical with (19). We should 
also observe that equations (18) and (20) by addition give equation 
(16), which therefore will not need to be considered in addition to 
the last three equations. 

14. The geometrical signification of our equations may now be 
simplified by a suitable choice of the position of the origin of 
coordinates, which is as yet wholly arbitrary. 

We shall hereafter suppose that the origin is placed in a plane of 
maximum or minimum displacement,* if such there are, In the case 
of circular polarization, in which the displacements are everywhere 
equal, its position is immaterial. The lines p, and p,, of which a,, 8,, 
y; and a,, B, y, are respectively the components, will now be the 
semi-axes of the displacement-ellipse, and therefore at right angles. 
(See § 9.) The case of circular polarization will not constitute any 
exception. Hence, 

42+ BB. +Yi1¥2= 9, (21) 
and by § 9, 


0 =L(Byy2— 182) + M (142 — a2) +N (0482 — Bya2)= + pips, (22) 
where we are to read + or — in the last member according as the 


system of displacements has the character of a right-handed or a 


left-handed screw. 
15. Equation (19) is now reduced to the form 


2a a + 208, By + Ley. + e(Bry2+ 12) 
+f (yraat aro) +9 (4182+ Bi2)=0, (28) 
*The reader will perceive that an earlier limitation of the position of the origin by a 


supposition of this nature, involving a limitation of the values of a1, 81, V1, %,5 Bos Ya» 
would have been embarrassing in the operations of the last paragraph. 
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which has a very simple geometrical signification. If we consider 
the ellipsoid aa? + by? +c2? + eyz+fout+ guy, (24) 


and especially its central section by a plane parallel to the planes of 
the wave-system which we are considering, it will easily appear that 
the equation 


20100 Wy + WY Yo + 2CZ,2, + C(Y {2% +2%Y2) 
Af (212g +02) +9 (4Yot Y1%_) =0 


will hold of any two points #,, y,, 2, and 2, Y_, 2%, which belong to 
conjugate diameters of this central section. Therefore equation (23) 
expresses that the displacements a,, B,, y, and aj, B,, y, are parallel 
to conjugate diameters of the central section of the ellipsoid (24) by a 
wave-plane. But since the displacements a,, 8,, y, and ay, B,, y2 are 
also at right angles to each other, it follows that they are parallel 
to the axes of the central section of the ellipsoid (24) by a wave-plane. 
That is:—The axes of the displacement-ellipse coincide in direction 
with those of a central section of the ellipsoid (24) by a wave-plane. 

16. If we write U,, U, for the reciprocals of the semi-axes of the 
central section of the ellipsoid (24) by a wave-plane, U, being the 
reciprocal of the one to which the displacement a,, 8,, y, is parallel, 
we have 


aa? +bBP+eyP+eByit+fyin +948, = UP(ar+Bye+y,7), (25) 


. ag is at once evident if we substitute the coordinates of an extremity 
of the axis for the proportional quantities a,, 8,, y,. So also 


Chats? + OB? + Crys? + Bayo t+ fot, + 942 8y = Uy?(ay? +B,’ +,7). (26) 
If we write V for the velocity of propagation of the system of 


progressive waves corresponding to the system of stationary waves 
which we have been considering, we shall have 


V=5 (27) 


By equations (22), (25), and (26), equations (18) and (20) are 
reduced to the form 


Upp? + g PiP2= V"py’, Us? pe? ‘ PiP2= V"p.”, (28) 


where we are to read + or — according as the disturbance has the 
character of a right-handed or a left-handed screw. Ina progressive 
system of waves, when the combination of displacements has the 
character of a right-handed screw, the rotations will be such as appear 
clockwise to the observer, who looks in the direction opposite to that 
of the propagation of light. We shall call such a ray right-handed. 
We may here observe that in case @=0 the solution of these 
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equations is very simple. We have necessarily either p,=0 and 
V?=U,’, or p,=0 and V?=U,2. In this case, the light is linearly 
polarized, and the directions of oscillation and the velocities of 
propagation are given by Fresnel’s law. Experiment has shown that 
this is the usual case. We wish, however, to investigate the case in 
which ¢ does not vanish. Since the term containing ¢ arises from 
the consideration of those quantities which it was allowable to neglect 
in the first approximation, we may assume that ¢@ is always very 
small in comparison with V°, U,3, or U,3. 
17. Equations (28) may be written 


2_ 1721 Pe ee “ Pi 
v?—-U, ET 5. VA=USS ra (29) 


By multiplication we obtain 
Noe Ve — ai)" (30) 


Since ¢ is a very small quantity, it is evident from inspection of this 
equation that it will admit three values of V’, of which one will be a 
very little greater than the greater of the two quantities U,? and U,?, 
another will be a very little less than the less of the same two quan- 
tities, and the third will be a very small quantity. It is evident that 
the values of V? with which we have to do are those which differ but 
little from U,? and U,”.* 

For the numerical computation of V, when U,, U,, and ¢ are 
known numerically, we may divide the equation by V*, and then 
solve it as if the second member were known. This will give 

2 2 2 2_ Ty 22 
year tls gy) aah (31) 
By substituting U,U, for V? in the second member, we may obtain 
a close approximation to the two values of V*. Each of the values 
obtained may be improved by substitution of that value for V? in the 
second member of the equation. 

For either value of V?, we may easily find the ratio of p, to pz, 
that is, the ratio of the axes of the displacement-ellipse, from one of 
equaticns (29), or from the equation 

2 V2 — Lue, 
aoe o 
obtained by combining the two. 


* We should not attribute any physical significance to the third value of V7, For 
this value would imply a wave-length very small in comparison with the length of 
ordinary waves of light, and with respect to which our fundamental assumption that 
the wave-length is very great in comparison with the distances of contiguous molecules 
would be entirely false. Our analysis, therefore, furnishes no reason for supposing that 
any such velocities are possible for the propagation of electrical disturbances. 
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In equations (29), we are to read + or — in the second members, 
according as the ray is right-handed or left-handed. (See § 16.) It 
follows that if the value of ¢ is positive, the greater velocity will 
belong to a right-handed ray, and the smaller to a left-handed, but if 
the value of ¢ is negative, the opposite is the case. Except when 
¢=0, and the polarization is linear, there will be one right-handed 
and one left-handed ray for any given wave-normal and period. 

18. When U,=U,, equations (29) give 


Pi= Po V=U+8, 


where U represents the common value of U, and U,. The polariza- 
tion is therefore circular. The converse is also evident from equations 
(29), viz. that a ray can be circularly polarized only when the 
direction of its wave-normal is such that U,=U,. Such a direction, 
which is determined by a circular section of the ellipsoid (24) precisely 
as an optic axis of a crystal which conforms to Fresnel’s law of double 
refraction, may be called an optic axis, although its physical pro- 
perties are not the same as in the more ordinary case.* If we write 
Vz and Vj, respectively, for the wave-velocities of the right-handed 
and left-handed rays, we have 


ms g ors As 33 
Vai=U'+yp, V2=U V,’ (33) 
Peak hee eee 
whence Vat-Vi=4(y-+9-)=0 VaVi’ 
Sepa’ 34 
and YS “nay Ve (34) 


The phenomenon best observed with respect to an optic axis is the 
rotation of the plane of linearly polarized light. If we denote by 0 
the amount of this rotation per unit of the distance traversed by the 
wave-plane, regarding it as positive when it appears clockwise to the 


* Our experimental knowledge of circularly or elliptically polarizing media is confined 
to such as are optically either isotropic or uniaxial. The general theory of such media, 
embracing the case of two optic axes, has however been discussed by Professor von Lang 
(‘‘Theorie der Circularpolarization,” Sitz.-Ber. Wiener Akad., vol. lxxv, p- 719). The 
general results of the present paper, although derived from physical hypotheses of an 
entirely different nature, are quite similar to those of the memoir cited. They would 


become identical, the writer believes, by the substitution of a constant for Pe or g 


Vv 
in the equations of this paper. (See especially equations (18), (20), (28).) 

That a complete discussion of the subject on any theory must include the case of 
biaxial media having the property of circular or elliptical polarization, is evident from 
the consideration that it must at least be possible to produce examples of such media 
artificially. An isotropic or uniaxial crystal may be made biaxial by pressure. If it 
has the property of circular and elliptic polarization, that property cannot be wholly 
destroyed by the application of small pressures. 


— 
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observer, who looks in the direction opposite to that of the propagation 
of the light,* we have 


ee oe | 
= ay. va) ae 
By the preceding equation, this reduces to 
ens 2 
O= VV" (36) 


Without any appreciable error, we may substitute U* for V,2V,2, 
which will givet 
oP 
rye (37) 
19. Since these equations involve unknown functions of the period 
they will not serve for an exact determination of the relation between 
@ and the period. For a rough approximation, however, we may 
assume that the manner in which the general displacement in any 
small part of the medium distributes itself among the molecules and 
intermolecular spaces is independent of the period, being determined 
entirely by the values of €, 9, & and their differential coefficients with 
respect to the coordinates.{ For a fixed direction of the wave-normal, 
@ and #’ will then be constant. Now equations (15) and (86) give 
p Qa?’ 
oa VE PVs pt 
To express this result in terms of the quantities directly observed, we 
may use the equations 
r k k k 
eg Le pane 1 Nagle Ue 
where & denotes the velocity of light 7 vacuo, the wave-length 
in vacuo of the light employed, ng, 7, the absolute indices of refrac- 
tion of the two rays, and 1 the index for the optic axis as derived 
from the ellipsoid (24) by Fresnel’s law. We thus obtain 


pes Ong? Ny? _ 2a? Png? Ny (39) 
Qh2r2 XG 
* When the rotation of the plane of polarization appears clockwise to the observer, 
it has the character of a left-handed screw. But the circularly polarized ray to which 
Vz relates, the rotation of which also appears clockwise to the observer, has the 
character of a right-handed screw. 
+The degree of accuracy of this substitution may be shown as follows. By (33) 
Vz (Vz? - U?)=Vi,(U?— 1), 
whence V22+ Vi= (Va +V;) U0’, 
V3? = ViVi +V2= WEP 
Vz Vzi= U?= (V3 - V1). 
+ Compare § 12 of the former paper, on page 189 of this volume. 
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In the case of uniaxial crystals, the direction of the optic axis is 
fixed. We may therefore write 


, 


Wie Ke 
o=ng?ns? (S5+55)> ae 


regarding K and K’ as constants. If we had used equation (37), we 
should have had the factor n‘ instead of ng’n,2, Since this factor 
varies but slowly with \, it may be neglected, if its omission is 
compensated in the values of K and K’. The formula being only 
approximative, such a simplification will not necessarily render it 
less accurate. 

20. But without any such assumption as that contained in the 
last paragraph, we may easily obtain formule for the experimental 
determination of @ and ©’ for the optic axis of a uniaxial crystal. 
Considerations analogous to those of §13 of the former paper 
(page 190 of this volume), show that in differentiating equation (39) 
we may regard ® and ©’ as constant, although they may actually 
vary with }. This equation may be written 


OX? BA’ 


mt OR >? 
“(2 
a 


When ©&’ has been determined by this equation, ® may be found 
from the preceding. 

21. If we wish to represent ¢ geometrically, like U, and U,, we 
may construct the surfaces 


(41) 


Therefore, 


= —27°’, (42) 


Ax? + By? + 022+ Eyz+Fer+Gry= +1, (43) 


the coefficients A, B, etc., being the same by which ¢ is expressed in 
terms of L?, M*, ete. The numerical value of ¢, for any direction 
of the wave-normal, will thus be represented by the square of the 
reciprocal of the radius vector of the surface drawn in the same 
direction. The positive or negative character of ¢ must be separately 
indicated. There are here two cases to be distinguished. If the 
sign of ¢ is the same in all directions, the surface will be an 
ellipsoid, and we have only to know whether all the values of ¢ are 
to be taken positively or all negatively. But if ¢ is positive for some 
directions and negative for others, the surface will consist of two 


conjugate hyperboloids, to one of which the positive, and to the other 
the negative values belong. 
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22, The manner in which the ellipsoid (24) may be partially 
determined by the relations of symmetry which the medium may 
possess, has been sufficiently discussed in the former paper. 

With respect to the quantity ¢, and the surfaces which determine 
it, the following principle is of fundamental importance. If one body 
is identical in its internal structure with the image by reflection of 
another, the values of ¢ in corresponding lines in the two bodies will 
be numerically equal but have opposite signs.* 

It follows that if a body is identical in internal structure with its 
own image by reflection, the value of ¢ (if not zero for all directions) 
must be positive for some directions and negative for others. More- 
over, the above described surface by which ¢ is represented must 
consist of two conjugate hyperboloids, of which one is identical in 
form with the image by reflection of the other. This requires that 
the hyperboloids shall be right cylinders with conjugate rectangular 
hyperbolas for bases. A crystal characterized by such properties will 
belong to the tetragonal system. Since ¢=0 for the optic axis, it 
would be difficult to distinguish a case of this kind from an ordinary 
uniaxial crystal, unless the ellipsoid (24) should approach very closely 
to a sphere.t 

It is only in the very limited case described in the last paragraph 
that a medium which is identical in its internal structure with its 
image by reflection can have the property of circular or elliptic 
polarization. To media which are unlike their images by reflection, 
and have the property of circular polarization, we may apply the 
following general principles. 

If the medium has any axis of symmetry, the ellipsoid or hyper- 
boloids which represent the values of ¢ will have an axis in the same 
direction. If the medium after a revolution of less than 180° about 
any axis is equivalent to the medium in its first position, the ellipsoid 
or hyperboloids will have an axis of revolution in that direction. 

23. The laws of the propagation of light in plane waves, which 


*The necessity of the opposite signs will perhaps appear most readily from the 
consideration that the direction of rotation of the plane of polarization must be opposite 
in the two bodies. 

+ There is no difficulty in conceiving of the constitution of a body which would have 
the properties described above. Thus, we may imagine a body with molecules of a 
spiral form, of which one-half are right-handed and one-half left-handed, and we may 
suppose that the motion of electricity is opposed by a less resistance within them than 
without. If the axes of the right-handed molecules are parallel to the axis of X, and 
those of the left-handed molecules to the axis of Y, their effects would counterbalance 
one another when the wave-normal is parallel to the axis of Z. But when the wave- 
normal (of a beam of linearly polarized light) is parallel to the axis of X, the left-handed 
molecules would produce a left-handed (negative) rotation of the plane of polarization, 
the right-handed molecules having no effect ; and when the wave-normal is parallel to 
the axis of Y, the reverse would be the case. 

G. Il. O 
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have thus been derived from the single hypothesis that the disturb- 
ance by which light is transmitted consists of solenoidal electrical 
fluxes, and which apply to light of different colors and to the most 
general case of perfectly transparent and sensibly homogeneous media 
not subject to magnetic action,* are essentially those which are 
generally received as embodying the results of experiment. In no 
particular, so far as the writer is aware, do they conflict with the 
results of experiment, or require the aid of auxiliary and forced 
hypotheses to bring them into harmony therewith. 

In this respect, the electromagnetic theory of light stands in marked 
contrast with that theory in which the properties of an elastic solid 
are attributed to the ether,—a contrast which was very distinct in 
Maxwell's derivation of Fresnel’s laws from electrical principles, but 
becomes more striking as we follow the subject farther into its details, 
and take account of the want of absolute homogeneity in the medium, 
so as to embrace the phenomena of the dispersion of colors and circular 
and elliptical polarization. 


* The rotation of the plane of polarization which is produced by magnetic action has 
been discussed by Maxwell (Treatise on Electricity and Magnetism, vol. ii, chap. xxi), 
and by Rowland (Amer. Journ. Math., vol. iii, p. 107). 


XIII. 


ON THE GENERAL EQUATIONS OF MONOCHROMATIC LIGHT 
IN MEDIA OF EVERY DEGREE OF TRANSPARENCY. 


[American Journal of Science, ser 3, vol. XXv, pp. 107-118, February, 1883. ] 


1, THE last April and June numbers of this Journal* contain an 
investigation of the velocity of plane waves of light, in which they 
are regarded as consisting of solenoidal electrical fluxes in an indefi- 
nitely extended medium of uniform and very fine-grained structure. 
It was also supposed that the medium was perfectly transparent, 
although without discussion of the physical properties on which 
transparency depends, and that the electrical motions were not 
complicated by any distinctively magnetic phenomena. 

In the present papert the subject will be treated with more 
generality, so as to obtain the general equations of monochromatic 
light for media of every degree of transparency, whether sensibly 
homogeneous or otherwise, which have a very fine-grained molecular 
structure as measured by a wave-length of light. There will be no 
restriction with respect to magnetic influence, except that an oscil- 
lating magnetization of the medium will be excluded. f 

In order to conform as much as possible to the ordinary view of 


*See pages 182-194 and 195-210 of this volume. 

+ This paper contains, with some additional developments, the substance of a com- 
munication to the National Academy of Sciences in November, 1882. 

t Where a body capable of magnetization is subjected to the influence of light (as 
when light is reflected from the surface of iron), there are two simple hypotheses which 
present themselves with respect to the magnetic state of the body. One is that the 
magnetic forces due to the light are not of sufficient duration to allow the molecular 
changes which constitute magnetization to take place to any sensible extent. The other 
is that the magnetization has a constant ratio to the magnetic force without regard to 
its duration. We might easily make a more general hypothesis which would embrace 
both of those mentioned as extreme cases, and which would be irreproachable from a 
theoretical stand-point ; but it would complicate our equations to a degree which would 
not be compensated by their greater generality, since no phenomena depending on such 
magnetization have been observed, so far as the writer is aware, or are likely to be, 
except in a very limited class of cases. ; 

For the purposes of this paper, therefore, it has seemed better to exclude media 
capable of magnetization, except so far as the first mentioned hypothesis may be appli- 
cable. But it does not appear that this requires us to exclude cases in which the medium 
is subject to the influence of a permanent magnetic force, such as produces the 
phenomenon of the magnetic rotation of the plane of polarization. 
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electrical phenomena,* we shall not introduce at first the hypothesis 
of Maxwell that electrical fluxes are solenoidal. Our results, however, 
will be such as to require us to admit the substantial truth of this 
hypothesis, if we regard the processes involved in the transmission of 
light as electrical. 

With regard to the undetermined questions of electrodynamic in- 
duction, we shall adopt provisionally that hypothesis which appears the 
most simple, yet proceed in such a manner that it will be evident exactly 
how our results must be altered, if we prefer any other hypothesis. 

Electrical quantities will be treated as measured in electromagnetic 
units. 

2. We must distinguish, as before, between the actual electrical 
displacements, which are too complicated to follow in detail with 
analysis, and which in their minutize elude experimental demonstration, 
and the displacements as averaged for spaces which are large enough 
to smooth out their minor irregularities, but not so large as to oblite- 
rate to any sensible extent those more regular features of the electrical 
motion, which form the subject of optical experiment. These spaces 
must therefore be large as measured by the least distances between 
molecules, but small as measured by a wave-length of light. We 
shall also have occasion to consider similar averages for other quan- 
tities, as electromotive force, the electrostatic potential, ete. It will 
be convenient to suppose that the space for which the average is 
taken is the same in all parts of the field,t say a sphere of uniform 
radius having its center at the point considered. 

Whatever may be the quantities considered, such averages will be 
represented by the notation 


[ lave: 


*It has, perhaps, retarded the acceptance of the electromagnetic theory of light that 
it was presented in connection with a theory of electrical action, which is probably more 
difficult to prove or disprove, and certainly presents more difficulties of comprehension, 
than the connection of optical and electrical phenomena, and which, as resting largely 
on @ priori considerations, must naturally appear very differently to different minds. 
Moreover, the mathematical method by which the subject was treated, while it will 
remain a striking monument of its author’s originality of thought, and profoundly 
modify the development of mathematical physics, must nevertheless, by its wide depar- 
ture from ordinary methods, have tended to repel such as might not make it a matter 
of serious study. 

+A flux is said to be solenoidal when it satisfies the conditions which characterize 
the motion of an incompressible fluid,—in other words, if u, v, w are the rectangular 
components of the flux, when 


and the normal component of the flux is the same on both sides of any surfaces of 
discontinuity which may exist. 
{ This is rather to fix our ideas, than on account of any mathematical necessity. For 


the space for which the average is taken may in general be considerably varied without 
sensibly affecting the value of the average. 
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If, then, € 7, § denote the components of the actual displacement at 
the point considered, 


lé re ? [7] Ave? [¢] Ave 


will represent the average values of these components in the small 
sphere about that point. These average values we shall treat as 
functions of the coordinates of the center of the sphere and of the 
time, and may call them, for brevity, the average values of €, n, & 
But however they may be designated, it is essential to remember that 
it is a space-average for a certain very small space, and never a time- 
average, that is intended. 

The object of this paper will be accomplished when we have 
expressed (explicitly or implicitly) the relations which subsist 
between the values of [&]Jave, [n]ave, [€Jave, at different times and in 
different parts of the field—in other words, when we have found 
the conditions which these quantities must satisfy as functions of 
the time and the coordinates. 

3. Let us suppose that luminous vibrations of any one period * are 
somewhere excited, and that the disturbance is propagated through 
the medium. The motions which are excited in any part of the 
medium, and the forces by which they are kept up, will be expressed 
by harmonic functions of the time, having the same period,t as may 
be proved by the single principle of the superposition of motions 
quite independently of any theory of the constitution of the medium, 
or of the nature of the motions, as electrical or otherwise. This is 
equally true of the actual motions, and of the averages which we are 
to consider. We may therefore set 


2a _ 2a 
[Elave = @, Cos ridge sin ae (1) 


etc., 


* There is no real loss of generality in making the light monochromatic, since in every 
case it may be divided into parts, which are separately propagated, and each of which 
is monochromatic to any required degree of approximation. 

+It is of course possible that the expressions for the forces and displacements should 
have constant terms. But these will disappear, if the displacements are measured from 
the state of equilibrium about which the system vibrates, and we leave out of account 
in measuring the forces (and the electrostatic potential) that which would belong to the 
system in the state of equilibrium. To prevent misapprehension, it should be added 
that the term electrical displacement is not used in the restricted sense of dielectric 
displacement or polarization. The variation of the electrical displacement, as the term 
is used in this paper, constitutes what Maxwell calls the total motion of electricity or 
true current, and what he divides into two parts, which he distinguishes as the current 
of conduction and the variation of the electrical displacement. Such a division of 
the total motion of electricity is not necessary for the purposes of this paper, and 
the term displacement is used with reference to the total motion of electricity in a 
manner entirely analogous to that in which the term is ordinarily used in the theory of 


wave-motion. 
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where ¢ denotes the time, p the period, and a, a,, functions of the 
coordinates. It follows that 


. Aq? 
[2lave= ace (2) 
etc. 


4, Now, on the electrical theory, these motions are excited by 
electrical forces, which are of two kinds, distinguished as electro- 
static and electrodynamic. The electrostatic force is determined by 
the electrostatic potential. If we write qg for the actual value of the — 
potential, and [q]av. for its value as averaged in the manner specified 
above, the components of the actual electrostatic force will be 

Be Fy ns 
da’ dy’ dz’ 
and for the average values of these components in the small spaces 
described above we may write 


= d [ave = AL |ave ey AL lave 
ders dy dz ’ 
for it will make no difference whether we take the average before 
or after differentiation. 

5. The electrodynamic force is determined by the acceleration of 
electrical flux in all parts of the field, but physicists are not entirely 
agreed in regard to the laws by which it is determined. ‘his 
difference of opinion is however of less importance, since it will not 
affect the result if electrical fluxes are always solenoidal. According 
to the most simple law, the components of the force are given by 
the volume-integrals 


-[[fi2. -[fffee. ~fffiae 


where dv represents an element of volume, and r the distance of this 
element from the point for which the value of the electromotive 
force is to be determined. In other words, the components of the 
force at any point are determined from the components of acceleration 
in all parts of the field by the same process by which (in the theories 
of gravitation, etc.) the value of the potential at any point is deter- 
mined from the density of matter in all parts of space, except that 
the sign is to be reversed. Adopting this law, provisionally at least, 
we may express it by saying that the components of electrodynamic 
force are equal to the potentials taken negatively of the components 
of acceleration of electrical flux. And we may write, for brevity, 


—Poté, —Poti, —Poté, 


for the components of force, using the symbol Pot to denote the 
operation by which the potential of a mass is derived from its — 


iy) 
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density. For the average values of these components in the small 
spaces defined above, we may write 


— Pot [Eaves — Pot [i Jave; — Pot [ Elave» 


since it will make no difference whether we take the average before 
or after the operation of taking the potential. 

6. If we write X, Y, Z for the components of the total electromotive 
force (electrostatic and electrodynamic), we have 


[X]ave ~The Pot take ee Ad lac, (3) 
ete., 
or by (2) 


_ An? _2ave 
[X ave = p Pot if lixe dz’ (4) 


ete. 
Tt will be convenient to represent these relations by a vector 
notation. If we represent the displacement by U, and the electro- 


motive force by E, the three equations of (3) will be represented by 
the single vector equation 


[E]ave= — Pot [T]are —V[q]ave, (5) 
and the three equations of (4) by the single vector equation 
Ax? 
[E]ave = p Pot [U]ave =i [Q aves (6) 


where, in accordance with quaternionic usage, V[q]ay. represents the 
vector which has for components the derivatives of [q],,. with respect 
to rectangular coordinates. The symbol Pot in such a vector equation 
signifies that the operation which is denoted by this symbol in a 
scalar equation is to be performed upon each of the components of 
the vector. 

7. We may here observe that if we are not satisfied with the law 
adopted for the determination of electrodynamic force we have only 
to substitute for — Pot in these vector equations, and in those which 
follow, the symbol for the operation, whatever it may be, by which 
we calculate the electrodynamic force from the acceleration.* For 
the operation must be of such a character that if the acceleration 
consist of any number of parts, the force due to the whole acceleration 
will be the resultant of the forces due to the separate parts. It will 
evidently make no difference whether we take an average before or 
after such an operation. 

*The same would not be true of the corresponding scalar equations, (3) and (4). For 


one component of the force might depend upon all the components of acceleration. 
Such is in fact the case with the law of electromotive force proposed by Weber. 
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8, Let us now examine the relation which subsists between the 
values of [E],,. and [U],,. for the same point, that is, between the 
average electromotive force and the average displacement in a small 
sphere with its center at the point considered. We have already seen 
that the forces and the displacements are harmonic functions of the 
time having a common period. 

A little consideration will show that if the average electromotive 
force in the sphere is given as a function of the time, the displace- 
ments in the sphere, both average and actual, must be entirely 
determined. Especially will this be evident, if we consider that since 
we have made the radius of the sphere very small in comparison with 
a wave-length, the average force must have sensibly the same value 
throughout the sphere (that is, if we vary the position of the center 
of the sphere for which the average is taken by a distance not greater 
than the radius, the value of the average will not be sensibly affected), 
and that the difference of the actual and average force at any point 
is entirely determined by the motions in the immediate vicinity of 
that point. If, then, certain oscillatory motions may be kept up in 
the sphere under the influence of electrostatic and electrodynamic 
forces due to the motion in the whole field, and if we suppose the 
motions in and very near that sphere to be unchanged, but the motions 
in the remoter parts of the field to be altered, only not so as to affect 
the average resultant of electromotive force in the sphere, the actual 
resultant of electromotive force will also be unchanged throughout 
the sphere, and therefore the motions in the sphere will still be such 
as correspond to the forces. 

Now the average displacement is a harmonic function of the time 
having a period which we suppose given. It is therefore entirely 
determined for the whole time the vibrations continue by the values 
of the six quantities 


fhe [Mave LC dase [Eaves fyle. tee 


at any one instant. For the same reason the average electromotive 
force is entirely determined for the whole time by the values of the 
six quantities 


[XJares EX Javes Ares RShe- Hares bate. 


for the same instant. The first six quantities will therefore be 
functions of the second, and the principle of the superposition of 
motions requires that they shall be homogeneous functions of the 
first degree. And the second six quantities will be homogeneous 
functions of the first degree of the first six. The coefficients by which 
these functions are expressed will depend upon the nature of the 
medium in the vicinity of the point considered. They will also 
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depend upon the period of vibration, that is, upon the color of the 
light.* 


We.may therefore write in vector notation 


[E]ave = [U]ave ~ ¥ [ave (7) 
where ® and denote linear functions.t 

The optical properties of media are determined by the form of 
these functions. But all forms of linear functions would not be 
consistent with the principle of the conservation of energy. 

In media which are more or less opaque, and which therefore 
absorb energy, ¥ must be of such a form that the function always 
makes an acute angle (or none) with the independent variable. In 
perfectly transparent media, must vanish, unless the function is 
at right angles to the independent variable. So far as is known, the 
last occurs only when the medium is subject to magnetic influence. 
In perfectly transparent media, the principle of the conservation of 
energy requires that ® should be self-conjugate, ie, that for three 
directions at right angles to one another, the function and independent 
variable should coincide in direction. 

Tn all isotropic media not subject to magnetic influence, it is prob- 
able that @ and VY reduce to numerical coefficients, as is certainly 
the case with ® for transparent isotropic media. 

9. Comparing the two values of [E],,., we have 


ae Pot [Tare aad Vv [ave = ® [Wave + eo [WJ are p (8) 


This equation, in connection with that by which we express the sole- 
noidal character of the displacements, if we regard them as necessarily 
solenoidal, or in connection with that which expresses the relation 
between the electrostatic potential and the displacements, if we reject 
the solenoidal hypothesis, may be regarded as the general equation of 
the vibrations of monochromatic light, considered as oscillating elec- 
trical fluxes. For the symbol Pot, however, we must substitute the 
symbol representing the operation by which electromotive force is 
calculated from acceleration of flux, with the negative sign, if we are 
not satisfied with the law provisionally adopted. 


*The relations between the displacements in one of the small spaces considered and 
the average electromotive force is mathematically analogous to the relation between the 
displacements in a system of a high degree of complexity and certain forces exerted 
from without, which are harmonic functions of the time and under the influence of 
which the system vibrates. The ratio of the displacements to the forces will in general 
vary with the period, and may vary very rapidly. — 

An example in which these functions vary very rapidly with the period is afforded 
by the phenomena of selective absorption and abnormal dispersion. 

+ A vector is said to be a linear function of another, when the three components of the 
first are homogeneous functions of the first degree of the three components of the second. 
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It is important to observe that the existence of molecular vibrations 
of ponderable matter, due to the passage of light through the medium, 
will not affect the reasoning by which this equation has been estab- 
lished, provided that the nature and intensity of these vibrations in 
any small part of the medium (as measured by a wave-length) are 
entirely determined by the electrical forces and motions in that part 
of the medium. But the equation would not hold in case of molecular 
vibrations due to magnetic force. Such vibrations would constitute 
an oscillating magnetization of the medium, which has already been 
excluded from the discussion. 

The supposition which has sometimes been made,* that electricity 
possesses a certain mass or inertia, would not at all affect the validity 
of the equation. 

10. The equation may be reduced to a form in some respects more 
simple by the use of the so-called imaginary quantities. We shall 
write . for ,/(—1). If we differentiate with respect to the time, and 


2 x 
substitute -S10] ave for [U],,., we obtain 


Am? dear? 
a Pot [U]ave a Vv [ave S ® [U]ave > a ay [UW] avo? 


If we multiply this equation by :, either alone or in connection with 
any real factor, and add it to the preceding, we shall obtain an 
equation which will be equivalent to the two of which it is formed. 
ee 
2ar 


= Pot ([U]ane—¢ 2 [Olare) -V ([daw— "2 Calan) 


mn (® 4 a ¥) ([U]are— ig [0] re) 


Multiplying by —2— and adding, we have 


If we set 
. W=[U]are—¢ F [Oar (9) 
T 
Q = [lave mee £ ares: (10) 
O=84.-7y, (11) 
; ce 

our equation reduces to 
dine 
ra Pot W—VQ=0W. (12) 


In this equation © denotes a complex linear vector function, ie., a 
vector function of which the X-, Y-, and Z-components are expressed 
in terms of the X-, Y-, and Z-components of the independent variable 
by means of coefficients of the form a+b. W is a bivector of which 


*See Weber, Abhandl. d. K. Stichs. Gesellsch. d. Wiss., vol. vi, pp. 593-597 ; Lorberg 
Crelle’s Journal, vol. 1xi, p. 55. : 
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the real part represents the averaged displacement [U],,., and the 
coefficient of « the rate of increase of the same multiplied by a 
constant factor. This bivector therefore represents the average 
state of a small part of the field both with respect to position and 
velocity. We may also say that the coefficient of « in W represents 
the value of the averaged displacement [U],,, at a time one-quarter 
of a vibration earlier than the time principally considered. 

11. It may serve to fix our ideas to see how W is expressed as a 
function of the time. We may evidently set 

[W]ave= A, cos sty sin ot 

where A, and A, are vectors representing the amplitudes of the two 
parts into which the vibration is resolved. Then 


Pyles At ane mss 
9 LU ave = A, sin = t+ A, cos = i 


and 
| : Qa Ae cee 
EU liver? £ [W]ave=(A,—1 A,)(cos a sin >) ; 
that is, if we set A=A,—cA,, 


2mt 


W=Ac?. (13) 
In like manner we may obtain 
Qmet 
Q=ge’, (14) 


where g is a biscalar, or complex quantity of ordinary algebra. 
Substituting these values in (12), and cancelling the common factor 
containing the time, we have 


ae Pot A—Vg=0A. (15) 
Our equation is thus reduced to one between A and g, and may easily 
be reduced to one in A alone.* Now A represents six numerical 
quantities (viz. the three components of A,, and the three of A,), 
which may be called the six components of amplitude. The equation, 
therefore, substantially represents the relations between the six com- 
ponents of amplitude in different parts of the field.t The equation 
is, however, not really different from (12), since A and g are only 
particular values of W and Q. 


*The terms VQ, Vq are allowed to remain in these equations, because the best 
manner of eliminating them will depend somewhat upon our admission or rejection of 
the solenoidal hypothesis. 

+The representation of the six components of amplitude by a single letter should not 
be regarded as an analytical artifice. It only leaves undivided in our notation that 
which is undivided in the nature of things. The separation of the six components of 
amplitude is artificial, in that it introduces arbitrary elements into the discussion, viz. 
the directions of the axes of the coordinates, and the zero of time. 
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12. From the general equation given above (8, 12, or 15), in con- 
nection with the solenoidal hypothesis, we may easily derive the laws 
of the propagation of plane waves in the interior of a sensibly homo- 
geneous medium, and the laws of reflection and refraction at surfaces 
between such media. This has been done by Maxwell,* Lorentz,t and 
others,t with fundamental equations more or less similar. 

The method, however, by which the fundamental equation has 
been established in this paper seems free from certain objections 
which have been brought against the ordinary form of the theory. 
As ordinarily treated, the phenomena are made to depend entirely 
on the inductive capacity and the conductivity of the medium, in a 
manner which may be expressed by the equation 


[Clan =(fe—Pag ) (FF Pot [Than —VMglae)» (16) 


p 
which will be equivalent to (12), if 


w=(e- 20) (> Pot W—VQ), (17) 


where K and C denote in the most general case the linear vector 
functions, but in isotropic bodies the numerical coefficients, which 
represent inductive capacity and conductivity. By a simple trans- 
formation {see (9) and (10)}, this equation becomes 


i 
iS) Lo Ae r= £ on’ 
where 0-1 represents the function inverse to 0. 

Now, while experiment appears to verify the existence of such a 
law as is expressed by equation (12), it does not show that 0 has the 
precise form indicated by equation (16). In other words, experiment 
does not satisfactorily verify the relations expressed by (16) and (17), 
if K and C are understood to be the operators (or, in isotropie bodies, 
the numbers) which represent inductive capacity and conductivity in 
the ordinary sense of the terms. 

The discrepancy is most easily shown in the most simple case, when 
the medium is isotropic and perfectly transparent, and © reduces to a 
numerical quantity. The square of the velocity of plane waves is 


then equal to Shit and equation (18) would make it independent of the 


(18) 


* Phil. Trans., vol. clv (1865), p. 459, or Treatise on Hlectricity and Magnetism, 
chap. xx. 

tSchlomilch’s Zeitschrift, vol. xxii, pp. 1-30 and 205-219 ; xxiii, pp. 197-210. 

tSee Fitzgerald, Phil. Trans,, vol. clxxi, p. 691; J. J. Thomson, Phil. Mag., 
(5), vol. ix, p. 284; Rayleigh, Phil. Mag. (5), vol. xii, p. 81. 

That the electromagnetic theory of light gives the conditions relative to the boundary 
of different media, which are required by the phenomena of reflection and refraction, 
was first shown by Helmholtz. See Crelle’s Journal, vol. xxii (1870), p. 57. 
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period; that is, would give no dispersion of colors. The case is 
essentially the same in transparent bodies which are not isotropic.* 

The case is worse with metals, which are characterized electrically 
by great conductivity, and optically by great opacity. In their 
papers cited above, Lorentz and Rayleigh have observed that the 
experiments of Jamin on the reflection of light from metallic surfaces 
would often require, as ordinarily interpreted on the electromagnetic 
theory, a negative value for the inductive capacity of the metal. 
This would imply that the electrical equilibrium in the metal is 
unstable. The objection, therefore, is essentially the same as that 
which Lord Rayleigh had previously made to Cauchy’s theory of 
metallic reflection, viz., that the apparent mechanical explanation 
of the phenomena is illusory, since the numerical values given by 
experiment as interpreted on Cauchy’s theory would involve an 
unstable equilibrium of the ether in the metal.t 

13. All this points to the same conclusion—that the ordinary view 
of the phenomena is inadequate. The object of this paper will be 
accomplished, if it has been made clear how a point of view more in 
accordance with what we know of the molecular constitution of 
bodies will give that part of the ordinary theory which is verified by 
experiment, without including that part which is in opposition to 
observed facts. 


* See note to the first paper of Lorentz, cited above, Schlomilch, vol. xxii, p. 23. 

+See Phil. Mag. (4), vol. xliii (1872), p. 321. 

+The consideration of the processes which we may suppose to take place in the 
smallest parts of a body through which light is transmitted, farther than is necessary to 
establish the general equation given above, is foreign to the design of this paper. Yeta 
word may be added with respect to the difficulties signalized in the ordinary form of the 
theory. The comparatively simple case of a perfectly transparent body has been 
examined more in detail in one of the papers already cited, where there is given an 
explanation of the dispersion of colors from the point of view of this paper. It is there 
shown that the effect of the non-homogeneity of the body in its smallest parts is to add 
a term to the expression for the kinetic energy of electrical waves, which for an isotropic 
body may be roughly described as similar to that which would be required if the 
electricity had a certain mass or inertia. (See especially §§ 7, 9 and 12, [this volume 
pages 185 ff.]) The same must be true of media of any degree of opacity. Now the 
difficulty with the optical properties of the metals is that the real part of © (or 07") 
is in some cases negative. This implies that at a moment of greatest displacement 
the electromotive force is in the direction opposite to the displacement, instead of 
having the same direction, as in transparent isotropic bodies. Now a certain part of 
the electromotive force must be required to oppose the apparent inertia, and another 
part to oppose the electrical elasticity of the medium. These parts of the force must 
haye opposite directions. In transparent bodies the latter part is by far the greater. But 
it need not surprise us that the former should be the greater in some metals. 

It has been remarked by Lorentz that the difficulty with respect to metals would be 
in a measure relieved if we should suppose electricity to have the property of inertia. 
(See § 11 of his third paper, Schlémilch’s Zeitschrift, vol. xxiii, p. 208.) But a supposi- 
tion of this kind, taken literally, would involve a dispersion of colors in vacuo, and still 
be inadequate, as Lorentz remarks, to explain.the phenomena observed in metals. 
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While the writer has aimed at a greater degree of rigor than is 
usual in the establishment of the fundamental equation of mono- 
chromatic light, it is not claimed that this equation is absolutely 
exact. The contrary is evident from the fact that the equation 
does not embrace the phenomena which characterize such circularly 
polarizing bodies as quartz. This, however, only implies the neglect 
of extremely small quantities—very small, for example, as compared 
with those which determine the dispersion of colors. In one of the 
papers already cited,* the case of a perfectly transparent body is 
treated with a higher degree of approximation, so as to embrace the 
phenomena in question. 


* See page 195 of this volume. 


28 he 


A COMPARISON OF THE ELASTIC AND THE ELECTRICAL 
THEORIES OF LIGHT WITH RESPECT TO THE LAW OF 
DOUBLE REFRACTION AND THE DISPERSION OF COLORS. 


[American Journal of Science, ser. 3, vol. XXXvV, pp. 467-475, June, 1888.] 


Ir is claimed for the electrical * theory of light that it is free from 
serious difficulties, which beset the explanation of the phenomena of 
light by the dynamics of elastic solids. Just what these difficulties 
are, and why they do not occur in the explanation of the same 
phenomena by the dynamics of electricity, has not perhaps been 
shown with all the simplicity and generality which might be desired. 
Such a treatment of the subject is however the more necessary on 
account of the ever-increasing bulk of the literature on either side, 
and the confusing multiplicity of the elastic theories. It is the object 
of this paper to supply this want, so far as respects the propagation 
of plane waves in transparent and sensibly homogeneous media. The 
simplicity of this part of the subject renders it appropriate for the 
first test of any optical theory, while the precision of which the 
experimental determinations are capable, renders the test extremely 
rigorous. 

It is moreover, as the writer believes, an appropriate time for the 
discussion proposed, since on one hand the experimental verification 
of Fresnel’s Law has recently been carried to a degree of precision far 
exceeding anything which we have had before,t and on the other, the 


*The term electrical seems the most simple and appropriate to describe that theory of 
light which makes it consist in electrical motions. The cases in which any distinctively 
magnetic action is involved in the phenomena of light are so exceptional, that it is 
difficult to see any sufficient reason why the general theory should be called ¢lectro- 
magnetic, unless we are to call all phenomena electromagnetic which depend on the 
motions of electricity. 

+In the recent experiments of Professor Hastings relating to the index of refraction of 
the extraordinary ray in Iceland spar for the spectral line D, and a wave-normal inclined. 
at about 31° to the optic axis, the difference between the observed and the calculated 
values was only two or three units in the sixth decimal place (in the seventh significant 
figure), which was about the probable error of the determinations. See Am. Jour. Sci. 


ser. 3, vol. xxxv, p. 60. 
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discovery of a remarkable theorem relating to the vibrations of a 
strained solid* has given a new impulse to the study of the elastic 
theory of light. 

Let us first consider the facts to which a correct theory must 
conform. : . 

It is generally admitted that the phenomena of light consist 
in motions (of the type which we call wave-motions) of some- 
thing which exists both in space void of ponderable matter, and 
in the spaces between the molecules of bodies, perhaps also in 
the molecules themselves. The kinematics of these motions is 
pretty well understood; the question at issue is whether it agrees 
with the dynamics of elastic solids or with the dynamics of 
electricity. 

In the case of a simple harmonic wave-motion, which alone we need 
consider, the wave-velocity (V) is the quotient of the wave-length (/) 
by the period of vibration (p). These quantities can be determined 
with extreme accuracy. In media which are sensibly homogeneous 
but not isotropic the wave-velocity V, for any constant value of the 
period, is a quadratic function of the direction cosines of a certain 
line, viz., the normal to the so-called “plane of polarization.” The 
physical characteristics of this line have been a matter of dispute. 
Fresnel considered it to be the direction of displacement. Others 
have maintained that it is the common perpendicular to the wave- 
normal and the displacement. Others again would define it as 
that component of the displacement which is perpendicular to the 
wave-normal. This of course would differ from Fresnel’s view only 
in case the displacements are not perpendicular to the wave-normal, 
and would in that case be a necessary modification of his view. 
Although this dispute has been one of the most celebrated in 
physics, it seems to be at length substantially settled, most directly 
by experiments upon the scattering of light by small particles, 
which seems to show decisively that in isotropic media at least 
the displacements are normal to the “plane of polarization,’ and 
also, with hardly less cogency, by the difficulty of accounting 
for the intensities of reflected and refracted light on any other 


*Sir Wm. Thomson has shown that if an elastic incompressible solid in which the 
potential energy of any homogeneous strain is proportional to the sum of the squares of 
the reciprocals of the principal elongations minus three is subjected to any homogeneous 
strain by forces applied to its surface, the transmission of plane waves of distortion, 
superposed on this homogeneous strain, will follow exactly Fresnel’s law (including the 
direction of displacement), the three principal velocities being proportional to the 
reciprocals of the principal elongations. It must be a surprise to mathematicians and 
physicists to learn that a theorem of such simplicity and beauty has been waiting to be 
discovered in a field which has been so carefully gleaned. See page 116 of the current 
volume (xxv) of the Philosophical Magazine. 
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supposition.* It should be added that all diversity of opinion on 
this subject has been confined to those whose theories are based on 
the dynamics of elastic bodies. Defenders of the electrical theory 
have always placed the electrical displacement at right angles to the 
“plane of polarization.” It will, however, be better to assume this 
direction of the displacement as probable rather than as absolutely 
certain, not so much because many are likely to entertain serious 
doubts on the subject, as in order not to exclude views which have 
at least a historical interest. 

The wave-velocity, then, for any constant period, is a quadratic 
function of the cosines of a certain direction, which is probably that 
of the displacement, but in any case determined by the displacement 
and the wave-normal. The coefficients of this quadratic function are 
functions of the period of vibration. It is important to notice that 
these coefficients vary separately, and often quite differently, with the 
period, and that the case does not at all resemble that of a quadratic 
function of the direction-cosines multiplied by a quantity depending 
on the period. 

In discussing the dynamics of the subject we may gain something 
in simplicity by considering a system of stationary waves, such as 
results from two similar systems of progressive waves moving in 
opposite directions. In such a system the energy is alternately 
entirely kinetic and entirely potential. Since the total energy is 
constant, we may set the average kinetic energy per unit of volume 
at the moment when there is no potential energy, equal to the average 
potential energy per unit of volume when there is no kinetic energy. 
We may call this the equation of energies. It will contain the 
quantities 2 and p, and thus furnish an expression for the velocity 
of either system of progressive waves. We have to see whether the 
elastic or the electric theory gives the expression most conformed to 
the facts. 

Let us first apply the elastic theory to the case of the so-called 


* « At the same time, if the above reasoning be valid, the question as to the direction 

of the vibrations in polarized light is decided in accordance with the view of Fresnel. 

I confess I cannot see any room for doubt as to the result it leadsto. ... Ionly 

mean that ?f light, as is generally supposed, consists of transversal vibrations similar to 

those which take place in an elastic solid, the vibration must be normal to the plane of 

polarization.” Lord Rayleigh ‘‘On the Light from the Sky, its Polarization and Color ;” 
Phil. Mag. (4), xli (1871), p. 109. 

«Green’s dynamics of polarization by reflexion, and Stokes’ dynamics of the diffraction 
of polarized light, and Stokes’ and Rayleigh’s dynamics of the blue sky, all agree in, as 
it seems to me, irrefragably, demonstrating Fresnel’s original conclusion, that in plane 
polarized light the line of vibration is perpendicular to the plane of polarization.” 
Sir Wm. Thomson, Joc. citat. 

+The terms kinetic energy and potential energy will be used in this paper to denote 
these average values. 

G. II. P 
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vacuum. If we write h for the amplitude measured in the middle 


between two nodal planes, the velocities of displacement will be as 
2 


z: and the kinetic energy will be represented by Ae where A is @ 


constant depending on the density of the medium. The potential 
energy, which consists in distortion of the medium, may be represented 

h2 
by Br ? 
medium. The equation of energies, on the elastic theory, is therefore 


Le (1) 


where B is a constant depending on the rigidity of the 


which gives V2=—= x (2) 


In the electrical theory, the kinetic energy is not determined by the 
simple formula of ordinary dynamics from the square of the velocity 
of each element, but is found by integrating the product of the 
velocities of each pair of elements divided by the distance between 
them. Very elementary considerations suffice to show that a quantity 
thus determined when estimated per unit of volume will vary as the 


2 
square of the wave-length. We may therefore set Fe for the kinetic 
P 


energy, F being a constant. The potential energy does not consist 
in distortion of the medium, but depends upon an elastic resistance 
to the separation of the electricities, which constitutes the electrical 
displacement, and is proportioned to the square of this displacement. 
The average value of the potential energy per unit of volume will 
therefore be represented in the electricaljtheory by Gh?, where G is a 
constant, and the equation of energies will®be 


h? ' 
F?—=Gh? (3) 
ie 
which gives 
Ppt 
V2 ~ pe = PF . (4) 


Both theories give a constant velocity, as is required. But it is 
instructive to notice the profound difference in the equations of energy 
from which this result is derived. In the elastic theory the square of 
the wave-length appears in the potential energy as a divisor; in the 
electrical theory it appears in the kinetic energy as a factor. 

Let us now consider how these equations will be modified by the 
presence of ponderable matter, in the most general case of transparent 
and sensibly homogeneous bodies. This subject is rendered much 
more simple by the fact that the distances between the ponderable 
molecules are very small compared with a wave-length. Or, what 


ia 
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amounts to the same thing, but may present a more distinct picture 
to the imagination, the wave-length may be regarded as enormously 
great in comparison with the distances between neighboring molecules. 
Whatever view we take of the motions which constitute light, we can 
hardly suppose them (disturbed as they are by the presence of the 
ponderable molecules) to be in strictness represented by the equations 
of wave-motion. Yet in a certain sense a wave-motion may and does 
exist. If, namely, instead of the actual displacement at any point, 
we consider the average displacement in a space large enough to 
contain an immense number of molecules, and yet small as measured 
by a wave-length, such average displacements may be represented 
by the equations of wave-motion; and it is only in this sense that 
any theory of wave-motion can apply to the phenomena of light in 
transparent bodies. When we speak of displacements, amplitudes, 
velocities (of displacement), etc., it must therefore be understood in 
this way. 

The actual kinetic energy, on either theory, will evidently be 
greater than that due to the motion thus averaged or smoothed, and 
to a degree presumably depending on the direction of the displace- 
ment. But since displacement in any direction may be regarded as 
compounded of displacements in three fixed directions, the additional 
energy will be a quadratic function of the components of velocity of 
displacement, or, in other words, a quadratic function of the direction- 
cosines of the displacement multiplied by the square of the amplitude 
and divided by the square of the period.* This additional energy 
may be understood as including any part of the kinetic energy of 
the wave-motion which may belong to the ponderable particles. The 
term to be added to the kinetic energy on the electric theory may 


2 
therefore be written for where fp is a quadratic function of the 


direction-cosines of the displacement. The elastic theory requires a 
term of precisely the same character, but since the term to which it 
is to be added is of the same general form, the two may be incor- 


2 
porated in a single term of the form “pai where Ap is a quadratic 


function of the direction-cosines of the displacement. We must, 
however, notice that both Ap and fp are not entirely independent of 
the period. For the manner in which the flux of the luminiferous 
medium is distributed among the ponderable molecules will naturally 
depend somewhat upon the period. The same is true of the degree 
to which the molecules may be thrown into vibration. But Ap and 
fo will be independent of the wave-length (except so far as this is 


*¥For proof in eatenso of this proposition, when the motions are supposed electrical, 
the reader is referred to page 187 of this volume. 
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connected with the period), because the wave-length is enormously 
great compared with the size of the molecules and the distances 
between them. 

The potential energy on the elastic theory must be increased by 
a term of the form b)h?, where by is a quadratic function of the 
direction-cosines of the displacement. For the ponderable particles 
must oppose a certain elastic resistance to the displacement of the 
ether, which in wolotropic bodies will presumably be different in 
different directions. The potential energy on the electric theory will 
be represented by a single term of the same form, say Gp’, where 
a quadratic function of the direction-cosines of the displacement, Gp, 
takes the place of the constant G, which was sufficient when the 
ponderable particles were absent. Both Gp and bp will vary to some 
extent with the period, like Ap and fp, and for the same reason. 

In regard to that potential energy, which on the elastic theory is 
independent of the direct action of the ponderable molecules, it has 
been supposed that in eolotropic bodies the effect of the molecules is 
such as to produce an xeolotropic state in the ether, so that the energy 
of a distortion varies with its orientation. This part of the potential 


2 
energy will then be represented by Byp where Byp is a function of 


the directions of the wave-normal and the displacement. It may 
easily be shown that it is a quadratic function both of the direction- 
cosines of the wave-normal and of those of the displacement. Also, 
that if the ether in the body when undisturbed is not in a state of 
stress due to forces at the surface of the body, or if its stress is 
uniform in all directions, like a hydrostatic pressure, the function Byp 
must be symmetrical with respect to the two sets of direction-cosines. 
The equation of energies for the elastic theory is therefore 


h? h? 
Avis =Byp pt byh?, (5) 
which gives Vem Le seer (6) 
p> Ap—bpp* 
The equation of energies for the electrical theory is 
Wan os dee 

BV at oa Onl (7) 
which gives v= BoP ee (8) 


It is evident at once that the electrical theory gives exactly the 
form that we want. For any constant period the square of the 
wave-velocity is a quadratic function of the direction-cosines of 
the displacement. When the period varies, this function varies, 
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the different coefficients in the function varying separately, because 
Gp and fp will not in general be similar functions.* If we consider a 
constant direction of displacement while the period varies, Gy) and 
Jp will only vary so far as the type of the motion varies, ie., so far 
as the manner in which the flux distributes itself among the 
ponderable molecules and intermolecular spaces, and the extent to 
which the molecules take part in the motion are changed. There 
are cases in which these vary rapidly with the period, viz, cases 
of selective absorption and abnormal dispersion. But we may fairly 
expect that there will be many cases in which the character of the 
motion in these respects will not vary much with the period. 


eh, 


- and F will then be sensibly constant and we have an approximate 


expression for the general law of dispersion, which agrees remarkably 
well with experiment.t 

If we now return to the equation of energies obtained from the 
elastic theory, we see at once that it does not suggest any such 
relation as experiment has indicated, either between the wave-velocity 
and the direction of displacement, or between the wave-velocity and 
the period. It remains to be seen whether it can be brought to 
agree with experiment by any hypothesis not too violent. 

In order that V? may be a quadratic function of any set of 
direction-cosines, it is necessary that Ap and bp shall be independent 
of the direction of the displacement, in other words, in the case of 
a crystal like Iceland spar, that the direct action of the ponderable 
molecules upon the ether, shall affect both the kinetic and the 
potential energy in the same way, whether the displacement take 
place in the direction of the optic axis or at right angles to it. 
This is contrary to everything which we should expect. If, never- 
theless, we make this supposition, it remains to consider Byp. This 
must be a quadratic function of a certain direction, which is almost 
certainly that of the displacement. If the medium is free from 
external stress (other than hydrostatic), Byp, as we have seen, is 
symmetrical with respect to the wave-normal and the direction 
of displacement, and a quadratic function of the direction-cosines 
of each. The only single direction of which it can be a function is 
the common perpendicular to these two directions. If the wave- 
normal and the displacement are perpendicular, the direction-cosines 


*But Gp, fp, and V®, considered as functions of the direction of displacement, are all 
subject to any law of symmetry which may belong to the structure of the body 
considered. The resulting optical characteristics of the different crystallographic 
systems are given on pages 192-194. , 

+This will appear most distinctly if we consider that V divided by the velocity of 
light in vacuo gives the reciprocal of the index of refraction, and p multiplied by the 
same quantity gives the wave-length in vacuo. 
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of the common perpendicular to both will be linear functions of the 
direction-cosines of each, and a quadratic function of the direction- 
cosines of the common perpendicular will be a quadratic function of 
the direction-cosines of each. We may thus reconcile the theory 
with the law of double refraction, in a certain sense, by supposing 
that A, and bp are independent of the direction of displacement, 
and that Byp and therefore V? is a quadratic function of the direction- 
cosines of the common perpendicular to the wave-normal and the 
displacement: But this supposition, besides its intrinsic improbability 
so far as A, and bp are concerned, involves a direction of the dis- 
placement which is certainly or almost certainly wrong. 

We are thus driven to suppose that the undisturbed medium is in 
a state of stress, which, moreover, is not a simple hydraulic stress. 
In this case, by attributing certain definite physical properties to 
the medium, we may make the function Byy become independent of 
the direction of the wave-normal, and reduce to a quadratic function 
of the direction-cosines of the displacement.* This entirely satisfies 
Fresnel’s Law, including the direction of displacement, if we can 
suppose Ay and bp independent of the direction of displacement. 
But this supposition, in any case difficult for aeolotropic bodies, 
seems quite irreconcilable with that of a permanent (not hydrostatic) 
stress. 

For this stress can only be kept up by the action of the ponderable 
molecules, and by a sort of action which hinders the passage of the 
ether past the molecules. Now the phenomena of reflection and 
refraction would be very different from what they are, if the optical 
homogeneity of a crystal did not extend up very close to the surface. 
This implies that the stress is produced by the ponderable particles 
in a very thin lamina at the surface of the crystal, much less in 
thickness, it would seem probable, than a wave-length of yellow 
light. And this again implies that the power of the ponderable 
particles to pin down the ether, as it were, to a particular position is 
very great, and that the term in the energy relating to the motion 
of the ether relative to the ponderable particles is very important. 
This is the term containing the factor bp, which it is difficult to 
suppose independent of the direction of displacement because the 
dimensions and arrangement of the particles are different in different 
directions. But our present hypothesis has brought in a new 
reason for supposing by to depend on the direction of displacement, 
viz., on account of the stress of the medium. A general displacement 
of the medium midway between two nodal planes, when it is 
restrained at innumerable points by the ponderable particles, will 


*See note on page 224. 
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produce special distortions due to these particles. The nature of 
these distortions is wholly determined by the direction of displace- 
ment, and it is hard to conceive of any reason why the energy of 
these distortions should not vary with the direction of displacement, 
like the energy of the general distortion of the wave-motion, which 
is partly determined by the displacement and partly by the wave- 
normal.* 

But the difficulties of the elastic theory do not end with the 
law of double refraction, although they are there more conspicuous 
on account of the definite and simple law by which they can be 
judged. It does not easily appear how the equation of energies 
can be made to give anything like the proper law of the dispersion 
of colors. Since for given directions of the wave-normal and dis- 
placement, or in an isotropic body, Byp is constant, and also Ap and bp, 
except so far as the type of the vibration varies, the formula requires 
that the square of the index of refraction (which is inversely as V?) 
should be equal to a constant diminished by a term proportional to 
the square of the period, except so far as this law is modified by a 
variation of the type of vibration. But experiment shows nothing 
like this law. Now the variation in the type of vibration is some- 
times very important,—it plays the leading role in the phenomena 
of selective absorption and abnormal dispersion,—but this is certainly 
not always the case. It seems hardly possible to suppose that the 
type of vibration is always so variable as entirely to mask the law 
which is indicated by the formula when Ap and bp (with Byp) are 
regarded as constant. This is especially evident when we consider 
that the effect on the wave-velocity of a small variation in the type of 
vibration will be a small quantity of the second order.t 

The phenomena of dispersion, therefore, corroborate the conclusion 
which seemed to follow inevitably from the law of double refraction 
alone. 


*The reader may perhaps ask how the above reasoning is to be reconciled with the 
fact that the law of double refraction has been so often deduced from the elastic theory. 
The troublesome terms are bp and the variable part of Ay, which express the direct 
action of the ponderable molecules on the ether. So far as the (quite limited) reading 
and recollection of the present writer extend, those who have sought to derive the law 


‘of double refraction from the theory of elastic solids have generally either neglected this 


direct action—a neglect to which Professor Stokes calls attention more than once in his 
celebrated ‘‘ Report on Double Refraction” (Brit. Assoc., 1862, pp. 264, 268)—or taking 
account of this action they have made shipwreck upon a law different from Fresnel’s and 
contradicted by experiment. 

+See pages 190, 191 of this volume, or Lord Rayleigh’s Theory of Sound, vol. i, 
p. 84, 
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A COMPARISON OF THE ELECTRIC THEORY OF LIGHT 
AND SIR WILLIAM THOMSON’S THEORY OF A QUASI- 
LABILE ETHER. 


[American Journal of Science, ser. 3, vol. XXXVU, pp. 139-144, February, 1889. ] 


A REMARKABLE paper by Sir William Thomson, in the November 
number of the Philosophical Magazine, has opened a new vista in 
the possibilities of the theory of an elastic ether. Since the general 
‘theory of elasticity gives three waves characterized by different 
directions of displacement for a single wave-plane, while the pheno- 
mena of optics show but two, the first point in accommodating any 
theory to observation, is to get rid (absolutely or sensibly) of the 
third wave. For this end, it has been common to make the ether 
incompressible, or, as it is sometimes expressed, to make the velocity 
of the third wave infinite. The velocity of the wave of compression 
becomes in fact infinite as the compressibility vanishes. Of course 
it has not escaped the notice of physicists that we may also get 
rid of the third wave by making its velocity zero, as may be done 
by giving certain values to the constants which express the elastic 
properties of the medium, but such values have appeared impossible, 
as involving an unstable state of the medium. The condition of 
incompressibility, absolute or approximate, has therefore appeared 
necessary.” This question of instability has now, however, been 
subjected to a more searching examination, with the result that 
the instability does not really exist “provided we either suppose the 
medium to extend all through bowndless space, or give it a fined 
containing vessel as its bowndary.” This renders possible a very 
simple theory of light, which has been shown to give Fresnel’s laws 
for the intensities of reflected and refracted light and for double 
refraction, so far as concerns the phenomena which can be directly 
observed. The displacement in an aeolotropic medium is in the same 
plane passing through the wave-normal as was supposed by Fresnel, 


*It was under this impression that the paper entitled “A Comparison of the Elastic 
and the Electric Theories of Light with respect to the Law of Double Refraction and the 
Dispersion of Colors,” [this volume pp. 223-231], was written. The conclusions of 


that paper, except so far as respects the dispersion of colors, will not apply to the 
new theory. 
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but its position in that plane is different, being perpendicular to the 
ray instead of to the wave-normal.* 

It is the object of this paper to compare this new theory with 
the electric theory of light. In the limiting cases, that is, when we 
regard the velocity of the missing wave in the elastic theory as zero, 
and in the electric theory as infinite, we shall find a remarkable 
correspondence between the two theories, the motions of monochromatic 
light within isotropic or aeolotropic media of any degree of trans- 
parency or opacity, and at the boundary between two such media, 
being represented by equations absolutely identical, except that the 
symbols which denote displacement in one theory denote force in 
the other, and vice versd.t In order to exhibit this correspondence 
completely and clearly, it is necessary that the fundamental principles 
of the two theories should be treated with the same generality, and, 
so far as possible, by the same method. The immediate consequences 
of the new theory will therefore be deduced with the same generality 
and essentially by the same method which has been used with 
reference to the electric theory in a former volume of this Journal 
[page 211 of this volume]. 

The elastic properties of the ether, according to the new theory, 
an its lamiting case, may be very simply expressed by means of a 
vector operator, for which we shall use Maxwell’s designation. The 
curl of a vector is defined to be another vector so derived from the 
first that if u, v, w be the rectangular components of the first, and 
uw, v, w’, those of its curl, 

, dw dv , du dw, dv du 
ae da? AE ae ee ae poll an ee Fi (1) 
where 2, y, 2 are rectangular coordinates. With this understanding, 
if the displacement of the ether is represented by the vector G, the 
force exerted upon any element by the surrounding ether will be 
—B curl curl € da dy dz, (2) 
where B is a scalar (the so-called rigidity of the ether) having the 
same constant value throughout all space, whether ponderable matter 
is present or not. 

Where there is no ponderable matter, this force must be equated to 
the reaction of the inertia of the ether. This gives, with omission of 
the common factor dx dy dz, 

AG=—B eurl curl &, (3) 
where A denotes the density of the ether. 


* Sir William Thomson, loc. citat. R. T. Glazebrook, Phil. Mag., December, 1888. 

+In giving us a new interpretation of the equations of the electric theory, the author 
of the new theory has in fact enriched the mathematical theory of physics with some- 
thing which may be compared to the celebrated principle of duality in geometry. 
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The presence of ponderable matter disturbs the motions of the 
ether, and renders them too complicated for us to follow in detail. 
Nor is this necessary, for the quantities which occur in the equations 
of optics represent average values, taken over spaces large enough to 
smooth out the irregularities due to the ponderable particles, although 
very small as measured by a wave-length.* Now the general prin- 
ciples of harmonic motiont show that to maintain in any element of 


volume the motion represented by 


2m ie 


E=Ae ”, (4) 
being a complex vector constant, will require a force from outside 
represented by a complex linear vector function of © that is, the 
three components of the force will be complex linear functions of the 
three components of © We shall represent this force by 

BYE da dy dz, (5) 

where represents a complex linear vector function.t 
If we now equate the force required to maintain the motion in any 
element to that exerted upon the element by the surrounding ether, 


we have the equation VE=—curl curl 6, (6) 


which expresses the general law for the motion of monochromatic 
light within any sensibly homogeneous medium, and may be regarded 
as implicitly including the conditions relating to the boundary of two 
such media, which are necessary for determining the intensities of 
reflected and refracted light. 

For let u, v, w be the components of G, 


w, Vv, w u " curl &, 
u,v, Ww . - curl curl &, 
so that ,_ dw dv , du dw , adv du 


Uy dpe ae ee 
yn aie, dat dt, d’_ dw 
dy dz’ dz dx’ dx dy’ 
and let the interface be perpendicular to the axis of Z. It is evident 


* This is in no respect different from what is always tacitly understood in the theory 
of sound, where the displacements, velocities, densities considered are always such 
average values. But in the theory of light, it is desirable to have the fact clearly in 
mind on account of the two interpenetrating media (imponderable and ponderable), the 


laws of light not being in all respects the same as they would be for a single homo- 
geneous medium. 


+See Lord Rayleigh’s Theory of Sound, vol. i, chapters iv, v. 
{It amounts essentially to the same thing, whether we regard the force as a linear 


vector function of & or of &, since these differ only by the constant factor — i But 


there are some advantages in expressing the force as a function of ©, because the 
greater part of the force, in the most important cases, is required to overcome the inertia 
of the ether, and is thus more immediately connected with &. 
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that if w’ or v’ is discontinuous at the interface, the value of w” or v” 
becomes in a sense infinite, ie., curl curl G, and therefore by (6) 
WG, will be infinite. Now both © and YW are discontinuous at the 
interface, but infinite values for WE are not admissible. Therefore 
w and v’ are continuous. Again, if w or v is discontinuous, w’ or v’ 
will become infinite, and therefore w” or v”. Therefore w and v are 
continuous. These conditions may be expressed in the most general 
manner by saying that the components of € and curl € parallel to the 
interface are continuous. This gives four complex scalar conditions, 
or in all eight scalar conditions, for the motion at the interface, which 
are sufficient to determine the amplitude and phase of the two 
reflected and the two refracted rays in the most general case. It is 
easy, however, to deduce from these four complex conditions, two 
others, which are interesting and sometimes convenient. It is evident 
from the definitions of w’ and w” that if wu, v, wv’, and v’ are continuous 
at the interface w’ and w” will also be continuous. Now —w’” is equal 
to the component of YE normal to the interface. The following 
quantities are therefore continuous at the interface: 

the components parallel to the interface of ©, 

the component normal to the interface of W6, (7) 

all components of curl ©. 


To compare these results with those derived from the electrical 
theory, we may take the general equation of monochromatic light on 
the electrical hypothesis from a paper in a former volume of this 
Journal. This equation, which with an unessential difference of nota- 
tion may be written * ys 

— Pot §-—VQ=47hF%, (8) 
was established by a method and considerations similar to those which 
have been used to establish equation (6), except that the ordinary law 
of electrodynamic induction had the place of the new law of elasticity. 
% is a complex vector representing the electrical displacement as a 
harmonic function of the time; ® is a complex linear vector operator, 
such that 47% represents the electromotive force necessary to keep 
up the vibration %. Q is a complex scalar representing the electro- 
static potential, VQ the vector of which the three components are 

IQ dQ dQ 

da’ dy’ dz’ 
Pot denotes the operation by which in the theory of gravitation the 
potential is calculated from the density of matter.t When it is 


* See page 218 of this volume, equation (12). peel 

+The symbol — Pot is therefore equivalent to 4rV-*, as used by Sir William Thomson 
(with a happy economy of symbols) at the last meeting of British Association to express 
the same law of electrodynamic induction, except that the symbol is here used as a 
vector operator. See Nature, vol. xxxvili, p. 571, sub init. 
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applied as here to a vector, the three components of the result are to 
be calculated separately from the three components of the operand, 
—VQ is therefore the electrostatic force, and — Pot % the electro- 
dynamic force. In establishing the equation, it was not assumed that 
the electrical motions are solenoidal, or such as to satisfy the so-called 
“equation of continuity.” We may now, however, make this assump- 
tion, since it is the extreme case of the electric theory which we are 
to compare with the extreme case of the elastic. 

It results from the definitions of cwrl and V that curl VYQ=0. We 
may therefore eliminate Q from equation (8) by taking the curl. 
This gives . 

—curl Pot F= 4a curl BF. (9) 


‘ 1 , j 
Since curl curl and ae Pot are inverse operators for solenoidal vectors, 


we may get rid of the symbol Pot by taking the curl again. We 
thus get * 
—=curl curl $F. (10) 

The conditions for the motion at the boundary between different 
media are easily obtained from the following considerations. Pot ¥ 
and Q are evidently continuous at the interface. Therefore the 
components parallel to the interface of VQ, and by (8) of &%, will be 
continuous. Again, curl Pot ¥ is continuous at the interface, as 
appears from the consideration that curl Pot $ is the magnetic force 
due to the electrical motions §. Therefore, by (9), curl ®¥ is con- 
tinuous. The solenoidal condition requires that the component of § 
normal to the interface shall be continuous. 

The following quantities are therefore continuous at the interface: 


the component normal to the interface of §%, 


the components parallel to the interface of 6%, 
(11) 
all components of curl PF. 


Of these conditions, the two relating to the normal components of $ 
and curl &% are easily shown to result from the other four conditions, 
as in the analogous case in the elastic theory. 


If we now compare in the two theories the differential equations of 
the motion of monochromatic light for the interior of a sensibly 
homogeneous medium, (6) and (10), and the special conditions for the 
boundary between two such media as represented by the continuity of 
the quantities (7) and (11), we find that these equations and conditions 
become identical, if 

v= VG, (12) 


C= OF, (18) 
V=6-1 (14) 
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In other words, the displacements in either theory are subject to the 
same general and surface conditions as the forces required to maintain 
the vibrations in an element of volume in the other theory. 

To fix our ideas in regard to the signification of YW and &, we may 
consider the case of isotropic media, in which these operators reduce 
to ordinary algebraic quantities, simple or complex. Now the curl 
of any vector necessarily satisfies the solenoidal condition (the 
so-called “equation of continuity”), therefore by (6) VE and € will 
be solenoidal. So also will § and % in the electrical theory. Now 
for solenoidal vectors 


OP 
—curl curl= dat * Tye t det? (15) 
so that the equations (6) and (10) reduce to 
See ye Te 
aa d? d? d2 
v= Gat att aate (17) 


For a simple train of waves, the displacement, in either theory, may 
be represented by a constant multiplied by 


gisttartig+a) (1 8) 
Our equations then reduce again to 
PVE=(a?+b?+c?)G, (19) 
PEH=(VCLO +E) PS. (20) 
Hence 
ie ae de 
¥ eer A (21) 


The last member of this equation, when real, evidently expresses the 
square of the velocity of light. If we set 


2 2 2 
ne=it ort (22) 


? 


k denoting the velocity of light in vacuo, we have 
Wale — iP P-1, (23) 
When 7? is positive, which is the case of perfectly transparent 
bodies, the positive root. of n? is called the index of refraction of the 
medium. In the most general case, it would be appropriate to call 
m—or perhaps that root of n? of which the real part is positive—the 
(complex) index of refraction, although the terminology is hardly 
settled in this respect. A negative value of ? would represent a 
body from which light would be totally reflected at all angles of 
incidence. No such cases have been observed. Values of ? in which 
the coefficient of « is negative, indicate media in which light is 
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absorbed. Values in which the coefficient of « is positive would 
represent media in which the opposite phenomenon took place.* 

It is no part of the object of this paper to go into the details 
by which we may derive, so far as observable phenomena are con- 
cerned, Fresnel’s law of double refraction for transparent bodies, 
as well as the more general law of the same character which relates to 
aeolotropic bodies of more or less opacity, and which differs from 
Fresnel’s only in that certain quantities become complex, or Fresnel’s 
laws for the intensities of reflected and refracted light at the boun- 
dary of transparent isotropic media, with the more general laws for 
the case of bodies aeolotropic or opaque or both. The principal cases 
have already been discussed on the new elastic theory in the 
Philosophical Magazinet and a further discussion is promised. For 
the electrical theory, the case of double refraction in perfectly trans- 
parent media has been discussed quite in detail in this Journal, t and 
the intensities of reflected and refracted light have been abundantly 
deduced from the above conditions by various authors.§ So far as all 
these laws are concerned, the object of this paper will be attained if 
if it has been made clear that the two theories, in their extreme cases, 
give identical results. The greater or less degree of elegance, or 
completeness, or perspicuity, with which these laws may be developed 
by different authors, should weigh nothing in favor of either theory. 

The non-magnetic rotation of the plane of polarization, with the 
allied phenomena in aeolotropic bodies, lie in a certain sense outside 
of the above laws, as depending on minute quantities which have been 
neglected in this discussion. The manner in which these minute 
quantities affect the equations of motion on the electrical theory has 
been ‘shown in a former paper, || where these phenomena in trans- 
parent bodies are treated quite at length. For the new theory, a 
discussion of this subject is promised by Mr. Glazebrook. 

But the magnetic rotation of the plane of polarization, with the 
allied phenomena when an aeolotropic body is subjected to magnetic 
influence, fall entirely within the scope of the above equations and 
surface-conditions. The characteristic of this case is that WV and @ 
are not self-conjugate. This is what we might expect on the electric 


*But « might have been introduced into the equations in such a way that a positive 
coefficient in the value of n? would indicate absorption, and a negative coefficient the 
impossible case. 

+Sir William Thomson, Joc. citat. R. T. Glazebrook, loc. citat. 

{ This vol. p. 182. 

§ Lorentz, Schlémilch’s Zeitschrift, vol. xxii, pp. 1-30 and 205-219; vol. xxiii, 
pp. 197-210; Fitzgerald, Phil. Trans., vol. clxxi, p. 691; J. J. Thomson, Phil. 
Mag. (5), vol. ix, p. 284; Rayleigh, Phil. Mag. (5), vol. xii, p. 81. Glazebrook, Proc. 
Cambr. Phil. Soc., vol. iv, p. 155. 

|| This vol. p. 195. 

{1 See this vol. p. 217. 
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theory from the experiments of Dr. Hall, which show that the 
operators expressing the relation between electromotive force and 
current are not in general self-conjugate in this case. 


In the preceding comparison, we have considered only the limiting 
cases of the two theories. With respect to the sense in which the 
limiting case is admissible, the two theories do not stand on quite 
the same footing. In the electric theory, or in any in which the 
velocity of the missing wave is very great, if we are satisfied that 
the compressibility is so small as to produce no appreciable results, 
we may set it equal to zero in our mathematical theory, even if we 
do not regard this as expressing the actual facts with absolute 
accuracy. But the case is not so simple with an elastic theory in 
which the forces resisting certain kinds of motion vanish, so far 
at least as they are proportional to the strains. The first requisite 
for any sort of optical theory is that the forces shall be proportional 
to the displacements. This is easily obtained in general by supposing 
the displacements very small. But if the resistance to one kind of 
distortion vanishes, there will be a tendency for this kind of distortion 
to appear at some places in an exaggerated form, and even to an 
infinite degree, however small the displacements may be in other 
parts of the field. In the case before us, if we suppose the velocity 
of the missing wave to be absolutely zero, there will be infinite 
condensations and rarefactions at a surface where ordinary waves 
are reflected. That is, a certain volume of ether will be condensed 
to a surface, and wice versd. This prevents any treatment of the 
extreme case, which is at once simple and satisfactory. The difficulty 
has been noticed by Sir William Thomson, who observes that it 
may be avoided if we suppose the displacements infinitely small in 
comparison with the wave-length of the wave of compression. This 
implies a finite velocity for that wave. A similar difficulty would 
probably be found to exist (in the extreme case) with regard to 
the deformation of the ether by the molecules of ponderable matter, 
as the ether oscillates among them. If the statical resistance to 
irrotational motions is zero, it is not at all evident that the statical 
forces evoked by the disturbance caused by the molecules would be 
proportional to the motions. But this difficulty would be obviated by 
the same hypothesis as the first. 

These circumstances render the elastic theory somewhat less con- 
-venient as a working hypothesis than the electric. They do not 
necessarily involve any complication of the equations of optics. For 
it may still be possible that this velocity of the missing wave is 
so small that the quantities on which it depends may be set equal 
to zero in the equations which represent the phenomena of optics. 
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But the mental processes by which we satisfy ourselves of the 
validity of our results (if we do not work out the whole problem 
in the general case of no assumption in regard to the velocity of 
the missing wave) certainly involve conceptions of a higher degree 
of difficulty on account of the circumstances mentioned. Perhaps 
this ought not to affect our judgment with respect to the question 
of the truth of the hypothesis. 

Although the two theories give laws of exactly the same form 
for monochromatic light in the limiting case, their deviations from 
this limit are in opposite directions, so that if the phenomena of 
optics differed in any marked degree from what we would have in 
the limiting case, it would be easy to find an experimentwin crucis 
to decide between the two theories. A little consideration will make 
it evident, that when the principal indices of refraction of a crystal are 
given, the intermediate values for oblique wave-planes will be less if 
the velocity of the missing wave is small but finite, than if it is infini- 
tesimal, and will be greater if the velocity of the missing wave is very 
great than if it is infinite.* Hence, if the velocity of the missing 
wave is small but finite, the intermediate values of the indices of 
refraction will be less than are given by Fresnel’s law, but if the 
velocity of the missing wave is very great but finite, the intermediate 
values of the indices of refraction will be greater than are given by 
Fresnel’s law. But the recent experiments of Professor Hastings on 
the law of double refraction in Iceland spar do not encourage us to 
look in this direction for the decision of the question.t 


In a simple train of waves in a transparent medium, the potential 
energy, on the elastic theory, may be divided into two parts, of which 
one is due to that general deformation of the ether which is repre- 
sented by the equations of wave-motion, and the other to those 
deformations which are caused by the interference of the ponderable 
particles with the wave-motion, and to such displacements of the 
ponderable matter as may be caused, in some cases at least, by the 
motion of the ether. If we write h for the amplitude, J for the wave- 
length, and p for the period, these two parts of the statical energy 
(estimated per unit of volume for a space including many wave- 
lengths) may be represented respectively by 

a’ Bh? bh? 
eS an ee 

* This may be more clear if we consider the stationary waves formed by two trains of 
Waves moving in opposite directions. The case then comes under the following theorem: 

“If the system undergo such a change that the potential energy of a given configur- 


ation is diminished, while the kinetic energy of a given motion is unaltered, the periods 


of the free vibrations are all increased, and conversely.” See Lord Rayleigh’s Theory of 
Sound, vol. i, p. 85. 


tAm. Jour. Sci., ser 3, vol. xxxv, p. 60. 


— 
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The sum of these may be equated to the kinetic energy, giving an 
equation of the form 

aw Bh? | bh? w?A’h? 

a gt 
B is an absolute constant (the rigidity of the ether, previously 
represented by the same letter), A’ and 6 will be constant (for the 
same medium and the same direction of the wave-normal) except 
so far as the type of the motion changes, ie, except so far as the 
manner in which the motion of the ether distributes itself between 
the ponderable molecules, and the degree in which these take part 
in the motion, may undergo a change. When the period of vibration 
varies, the type of motion will vary more or less, and A’ and b will 
vary more or less. 

In a manner entirely analogous,* the kinetic energy, on the elec- 
trical theory, may be divided into two parts, of which one is due to 
those general fluxes which are represented by the equations of wave- 
motions, and the other to those irregularities in the fluxes which 
are caused by the presence of the ponderable molecules, as well as to 
such motions of the ponderable particles themselves as may some- 
times occur. These parts of the kinetic energy may be represented 
respectively by 


(24) 


abba a eee 
oe - 
Their sum equated to the potential energy gives 
aECh? “afk? Gh? 
| ae lis ; ate me Be 
Here F is the constant of electrodynamic induction, which is unity if 
we use the electromagnetic system of units, f and G (like A’ and 6) 
vary only so far as the type of motion varies. 

We have the means of forming a very exact numerical estimate 
of the ratio of the two parts into which the statical energy is thus 
divided on the elastic theory, or the kinetic energy on the electric 
theory. The means for this estimate is afforded by the principle 
that the period of a natural vibration is stationary when its type is 
infinitesimally altered by any constraint.t Let us consider a case of 
simple wave-motion, and suppose the period to be infinitesimally 
varied: the wave-length will also vary, and presumably to some 
extent the type of vibration. But, by the principle just stated, if the 
~ ether or the electricity could be constrained to vibrate in the original 
type, the variations of / and p would be the same as in the actual 


(25) 


* See page 182 of this volume. re 
+See Lord Rayleigh’s Theory of Sound, vol. i; p. 84. The application of the principle 


is most simple in the case of stationary waves. 


G. II. 
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case. Therefore, in finding the differential equation between / and p, 
we may treat b and A’ in (24) and f and G in (25) as constant, as well 
as Band F. These equations may be written 


2 
40° BE, + bp? = 4A’, 


7k a + or =3G. 
Differentiating, we get 
4n°B dE, = —bd(p), 
2 
aE a = —7°*fd(p~*); 
or 


2 2 
4n®Be dlogh = — bp*d log p?, 


[2 5 ie 


(2 [2 arf . 
hare eras igs d log p-*. 

Hence, if we write V for the wave-velocity (l/p), » for the index of 
refraction, and A for the wave-length im vacuo, we have for the ratio 
of the two parts into which we have divided the potential energy on 
the elastic theory, 

bh? ; mBh’? _dlogV__dlogn (26) 

4. P d log p d log X 

and for the ratio of the two parts into which we have divided the 
kinetic energy on the electrical theory, 


tlt, TEP HH _dlog V_ _dlogn 
Prt: dlog p d log 


(27) 


It is interesting to see that these ratios have the same value. This 


value may be expressed in another form, which is suggestive of some — 
important relations. If we write U for what Lord Rayleigh has 
called the velocity of a group of waves,* 


U_, dlogV 
VC palee te 
dlogV_V-U 
dlogt. “Ver 
dlogV _V—U 
dlogp = U — (28) 


It appears, therefore, that in the elastic theory that part of the — 
potential energy which depends on the deformation expressed — 


*See his ‘‘ Note on Progressive Waves,” Proc. Lond. Math. Soc., vol. ix, No. 125, 
reprinted in his Theory of Sound, vol. ii, p. 297. 
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by the equations of wave-motion, bears to the whole potential 
energy the same ratio which the velocity of a group of waves 
bears to the wave-velocity. In the electrical theory, that part of 
the kinetic energy which depends on the motions expressed by the 
equations of wave-motion bears to the whole kinetic energy the same 
ratio. 

Returning to the consideration of equations (26) and (27), we 
observe that in transparent bodies the last member of these equations 
represents a quantity which is small compared with unity, at least in 
the visible spectrum, and diminishes rapidly as the wave-length 
increases. This is just what we should expect of the first member of 
equation (27). But when we pass to equation (26), which relates to 
the elastic theory, the case is entirely different. The fact that the 
kinetic energy is affected by the presence of the ponderable matter 
and affected differently in different directions, shows that the motion 
ot the ether is considerably modified. This implies a distortion super- 
posed upon the distortion represented by the equations of wave- 
motion, and very much greater, since the body is very fine-grained as 
measured by a wave-length. With any other law of elasticity, we 
should suppose that the energy of this superposed distortion would 
enormously exceed that of the regular distortion represented by the 
equations of wave-motion. But it is the peculiarity of this new law 
of elasticity that there is one kind of distortion, of which the energy 
is very small, and which is therefore peculiarly likely to occur. Now 
if we can suppose the distortion caused by the ponderable molecules 
to be almost entirely of this kind, we may be able to account for the 
smallness of its energy. We should still expect the first member of 
(26) to increase with the wave-length, on account of the factor /?, 
instead of diminishing, as the last member of the equation shows that 
it does. We are obliged to suppose that b, and therefore the type of 
the vibrations, varies very rapidly with the wave-length, even in those 
cases which appear farthest removed from anything like selective 
absorption. 


The electrical theory furnishes a relation between the refractive 
power of a body and its specific dielectric capacity, which is com- 
monly expressed by saying that the latter is equal to the square of 
_ the index of refraction for waves of infinite length. No objection can 

be made to this statement, but the great uncertainty in determining 
the index for waves of infinite length by extrapolation prevents it 
from furnishing any very rigorous test of the theory. Yet, as the 
results of extrapolation in some cases agree strikingly with the 
specific dielectric capacity, although in other cases they are quite 
different, the correspondence is generally regarded as corroborative, in 
some degree, of the theory. But the relation between refractive power 
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and dielectric capacity may be expressed in a form which will furnish 
a more rigorous test, as not involving extrapolation. 

We have seen on page 242 how we may determine numerically the 
ratio of the two first terms of equation (25). We thus easily get 
the ratio of the first and last term, which gives 

Gh? _dlogl «Fh? 
4 dlogyX p?® — 
In the corresponding equation for a train of waves of the same ampli- 
tude and period in vacuo, 1 becomes A, F remains the same, and for G 
we may write G’. This gives 


(29) 


GR? _ PNK? (30) 
4 
Dividing, we get 


G dlogx 2 d(r2)’ 

Now GQ’ is the dielectric elasticity of pure ether. If K is the 
specific dielectric capacity of the body which we are considering, 
G’/K is the dielectric elasticity of the body and G’/2K is the potential _ 
energy of the body (per unit of volume), due to a unit of ordinary _ 
electrostatic displacement. But Gh?/4 is the potential energy in a 


train of waves of amplitude h. Since the average square of the dis- 


placement is h?/2, the potential energy of a unit displacement such as 
occurs in a train of waves is G/2. Now in the electrostatic experi- 


ment the displacement distributes itself among the molecules so as to 


make the energy a minimum. But in the case of light the distribu- 
tion of the displacement is not determined entirely by statical 
considerations. Hence 


G_ G@ 
oa ee (32) 
Cc’ 
K=5, 
and 
dr?) 
K = a(P) 3 (33) 


It is to be observed that if we should assume for a dispersion-formula _ 
n-2=a—br-2, (34) 


1/a, which is the square of the index of refraction for an infinite 
wave-length, would be identical with the second member of (33). { 

Another similarity between the electrical and optical properties of 
bodies consists in the relation between conductivity and opacity. 
Bodies in which electrical fluxes are attended with absorption of. 
energy absorb likewise the energy of the motions which constitute 


ee 


. a 
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light. This is strikingly true of the metals. But the analogy does 
not stop here. To fix our ideas, let us consider the case of an 
isotropic body and circularly polarized light, which is geometrically 
the simplest case although its analytical expression is not so simple 
as that of plane-polarized light. The displacement at any point 
may be symbolized by the rotation of a point in a circle. The 
external force necessary to maintain the displacement § is represented 
by n~?%. In transparent bodies, for which n-? is a positive number, 
the force is radial and in the direction of the displacement, being 
principally employed in counterbalancing the dielectric elasticity, 
which tends to diminish the displacement. In a conductor n~? 
becomes complex, which indicates a component of the force in the 
direction of %, that is, tangential to the circle. This is only the 
analytical expression of the fact above mentioned. But there is 
another optical peculiarity of metals, which has caused much remark, 
viz., that the real part of n? (and therefore of n-2) is negative, ie., 
the radial component of the force is directed towards the center. 
This inwardly directed force, which evidently opposes the electro- 
dynamic induction of the irregular part of the motion, is small 
compared with the outward force which is found in transparent 
bodies, but increases rapidly as the period diminishes. We may say, 


therefore, that metals exhibit a second optical peculiarity,—that the 


Y 


dielectric elasticity is not prominent as in transparent bodies. This 
is like the electrical behavior of the metals, in which we do not 
observe any elastic resistance to the motion of electricity. We see, 


_ therefore, that the complex indices of metals, both in the real and 


the imaginary part of their inverse squares, exhibit properties 
corresponding to the electrical behavior of the metals. 

The case is quite different in the elastic theory. Here the force 
from outside necessary to maintain in any element of volume the 
displacement € is represented by 126. In transparent bodies, 
therefore, it is directed toward the center. In metals, there is a 
component in the direction of the motion ©, while the radial part of 
the force changes its direction and is often many times greater than 


‘the opposite force in transparent bodies. This indicates that in 


metals the displacement of the ether is resisted by a strong elastic 
force, quite enormous compared to anything of the kind in trans- 
parent bodies, where it indeed exists, but is so small that it has been 


neglected by most writers except when treating of dispersion. We 


ean make these suppositions, but they do not correspond to 


anything which we know independently of optical experiment. 


It is evident that the electrical theory of light has a serious rival, 


in a sense in which, perhaps, one did not exist before the publication 
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of Sir William Thomson’s paper in November last.* Nevertheless, 
neither surprise at the results which have been achieved, nor 
admiration for that happy audacity of genius, which, seeking the 
solution of the problem precisely where no one else would have 
ventured to look for it, has turned half a century of defeat into 
victory, should blind us to the actual state of the question. 


It may still be said for the electrical theory, that it is not obliged 


to invent hypotheses,t but only to apply the laws furnished by 
the science of electricity, and that it is difficult to account for the 
coincidences between the electrical and optical properties of media, 
unless we regard the motions of light as electrical. But if the 
electrical character of light is conceded, the optical problem is very 
different from anything which existed in the time of Fresnel, Cauchy, 
and Green. The third wave, for example, is no longer something 
to be gotten rid of guocunque modo, but something which we must 
dispose of in accordance with the laws of electricity. This would 
seem to rule out the possibility of a relatively small velocity for 
the third wave. 


* «Since the first publication of Cauchy’s work on the subject in 1830, and of Green’s 
in 1837, many attempts have been made by many workers to find a dynamical 


foundation by Fresnel’s laws of reflexion and refraction of light, but all hitherto 


ineffectually.” Sir William Thomson, /oc. citat. 

‘So far as I am aware, the electric theory of Maxwell is the only one satisfying these 
conditions (of explaining at once Fresnel’s laws of double refraction in crystals and 
those governing the intensity of reflexion when light passes from one isotropic medium 
to another).” Lord Rayleigh, Phil. Mag., September, 1888. 

+ Electrical motions in air, since the recent experiments of Professor Hertz, seem to 
be no longer a matter of hypothesis. We can hardly suppose that the case is essentially 
different with the so-called vacuum. The theorem that the electrical motions of light 


are solenoidal, although it is convenient to assume it as a hypothesis and show that the | 


results agree with experiment, need not occupy any such fundamental position in the 
theory. It is in fact only another way of saying that two of the constants of electrical 
science have a certain ratio (infinity). It would be easy to commence without assuming 
this value, and to show in the course of the development of the subject that experiment 


requires it, not of course as an abstract proposition, but in the sense in which. 


experiment can be said to require any values of any constants, that is, to a certain 
degree of approximation. 


Pa See 


XVI. 


REVIEWS OF NEWCOMB AND MICHELSON’S « VELOCITY 
OF LIGHT IN AIR AND REFRACTING MEDIA” AND 
OF KETTELER’S “THEORETISCHE OPTIK.” 


[American Jowrnal of Science, ser. 3, vol. XXXI. pp. 62-67, Jan. 1886.] 


Velocity of Light in Air and Refracting Media. 


Astrononomical Papers prepared for the use of the American Ephemeris and 
Nautical Almanac, vol. 11. parts 3 and 4,* Washington, 1885, 


PROFESSOR NEWcoMB obtains as the final result of his experiments 
at Washington 299,860+30 kilometers per second for the velocity of 
light 72 vacuo. Professor MICHELSON’s entirely independent experi- 
ments at Cleveland give substantially the same result (299,853+60) 
His former experiments at the Naval Academy, after correction of 
two small errors which he now reports, give 299,910+50. All these 
experiments were made with the revolving mirror, but the arrange- 
ments of the two experimenters were in other respects radically 
different. The first of these values of the velocity of light with 
Nyrén’s value of the constant of aberration (20"492) gives 149-60 for 
the distance of the sun in millions of kilometers. On acount of the 
recent announcement by Messrs. Young and Forbes of a difference 
of about two per cent. in the velocities of red and blue light, 
especial attention was paid to this point by both experimenters, 
without finding the least indication of any difference. In Professor 
Newcomb’s experiments, a difference of only one thousandth in these 
velocities would have produced a well-marked iridescence on the 
edges of the return image of the slit formed by reflection from the 
revolving mirror. No trace of such iridescence could ever be seen. 
Professor Michelson made an experiment, in which a red glass covered 
one-half the slit. The two halves of the image—the upper white, 
the lower red—were exactly in line. 

Since Maxwell’s electromagnetic theory of light makes the velocity 
of light in air equal to the ratio of the electromagnetic and 


*[Part 3, ‘‘ Measures of the Velocity of Light,” S. Newcomb; part 4, “ Supple- 
mentary Measures of the Velocities of white and colored Light in air, water, 
and carbon disulphide,” A. A. Michelson. ] 
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electrostatic units of electricity, it will be interesting to compare 
some recent determinations of this ratio. These we give in the 
following table. Since the determinations are affected by any error 
in the standard of resistance, we have corrected the results, first, on 
the supposition that the B.A. ohm =-987 true ohms (Lord Rayleigh’s 
result), and secondly, on the supposition that the B.A. ohm ="989 true 
ohms, which is essentially assuming that the legal ohm represents the 
true value. 


Ratio of Electromagnetic and Electrostatic units of Hlectricity in millions of 
meters and seconds. 


Date. As published. B.A. ohm='987. B.A. ohm= "989, 
Ayrton & Perry,* 1878 298 °0 296°1 296°4 
Hockin, t 1879 298°8 296°9 297 2 
Shida, ¢ 1880 299°5 295°6 296 2 
Exner, § 1882 301°1 (?) 291°7 (?) 292°3 (?) 
J. J. Thomson, || 1883 296°3 296°3 296°9 
Klemenéié, 7 1884 301°88 (?) 301°88 (?) 302°48 (?) 


These numbers are to be compared with the velocity of light in air, in 
millions of meters per second, for which Professor Newcomb gives 
299:778. Of the electrical determinations, that of J. J. Thomson 
appears by far the most worthy of confidence. That of Klemen¢i¢— 
the only one as great as the velocity of light—was obtained by the 
use of a condenser with glass,—a method which would presumably 
give too great a ratio. Exner’s value is obtained from the mean of 
three determinations, one of which differed from the others by about 
three per cent. If we reject this discordant determination, the mean 
of the other two would give when corrected for resistance 2944 and 
2950. If we set aside the determinations of Exner and Klemenéié, 
the remaining four, which represent three different methods, are very 
accordant, the mean being nearly identical with the result of J. J. 
Thomson, and about one per cent. less than the velocity of light. 

Professor Michelson’s experiments on the velocity of light in carbon 
disulphide afford an interesting illustration of the difference between 
the velocity of waves and the velocity of groups of waves—a subject 
which is treated at length in an appendix to the second volume of 
Lord Rayleigh’s Theory of Sownd. If we write V for the velocity 
of waves, U for that of a group of waves, L for the wave-length, and 
T for the period of vibration, 

gon _a(t) 
AO: ae: 

For purposes of numerical calculation, it will be convenient to 
transform these formule by the use of for the wave-length in 


* Phil. Mag., (5), vol. vii, p. 277. + Report Brit. Assoc., 1879, p. 285. 
+ Phil. Mag., (5), vol. x, p. 481. § Sttzwngsberichte Wien. Akad., vol. lxxxvi, p. 106. 
| Phil. Trans., vol. clxxiv, p. 707. WT Sitzwngsberichte Wien. Akad., vol. lxxxix, p. 298. 
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vacuo, n for the index of refraction of the medium considered, and 
for the velocity of light in vacwo, which we shall regard as constant, 
in accordance with general usage. By substitution of these letters we 
easily obtain 

k k d(nad-}) 

vy 0 aasy: 
The data for the calculation of these quantities for carbon disulphide 
are given by Verdet (Annales de Chimie et de Physique, (3), vol. lxix, 
p. 470). They give 


for the line D, k/V=1°624, k/U=1-722, 
for the line E, k/V=1°637, k/U=1:767. 


The quotient of the velocity in vacuo divided by the velocity in 
carbon disulphide, according to Professor Michelson’s experiments 
with the light of an are lamp, is 1°76+°02, which agrees very well 
with &/U. Another theory, which would make the velocity observed 
in such experiments V?/U (Natwre, vol. xxv, p. 52), receives no 
countenance from these experiments. The value of kU/V? would 
be about 1°53. Some may think that the experiments on water point 
in a different direction. Taking our data from Beer’s Hinleitung in 
die héhere Optik, 1853, p. 411, we get 

for D, k/V=1:334, k/U=1:352, kU/V2=1°316, 

for E, k/V=1:336, k/U=1:359, kU/V?=1°313. 
The number obtained by experiment was 1,330, which agrees better 


with k/V, or even with kU/V?, than with k/U, but the differences are 
here too small to have much significance. 


Theoretische Optik, gegriindet auf das Bessel-Sellmeier’sche 
Princip, zugleich mit den experimentellen Belegen. 


Von Dr. E. KeEtTreter, Professor an der Universitit in Bonn. Viewig 
und Sohn. Braunschweig, 1885. 


The principle of Sellmeier, here referred to, relates to vibrations of 
ponderable particles excited by the etherial vibrations of light, and to 
the reaction of the former upon the latter. The name of Bessel is 
added on account of his previous solution of a somewhat analogous 
problem relating to the pendulum. The object of this work is “to 
treat theoretical optics in a complete and uniform manner on the new 
foundation of the simultaneous vibration of etherial and ponderable 
particles, and to substitute a consistent and systematic new structure 
for the present conglomerate of more or less disconnected principles.” 
Such a work demands a critical examination, which should not be 
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undertaken from any narrow point of view. Any faults of detail will 
be readily forgiven, if the author shall give the theory of optics 
the 700 «76 which it has sought so long in vain. We may add that 
if this effort shall not be judged successful by the scientific world, 
the author will at least have the satisfaction of being associated in his 
failure with many of the most distinguished names in minis 
physics. 

We have sought to test the proposed theory with respect to that 
law of optics which seems most conspicuous in its definite mathe- 
matical form, and in the rigor of the experimental verifications to 
which it has been subjected, as well as in the magnificent develop- 
ments to which it has given rise: the law of double refraction due to 
Huyghens and Fresnel, and geometrically illustrated by the wave- 
surface of the latter. We cannot find that the law of Fresnel is 
proved at all in this treatise. We find on the contrary, that a law is 
deduced which is different from Fresnel’s, and inconsistent with it. 
We do not refer to anything relating to the direction of vibration of 
the rays in a crystal, which is a point not touched by the experi- 
mental verifications to which we have alluded. We shall confine 
our comparison to those equations from which the direction of 
vibration has been eliminated, and which therefore represent relations 
subject to experimental control. For this purpose equation (13) on 
page 299 is suitable. It reads 

uw 
Nz eto ata = 

Nx, Ny, Nz being the principal indices of refraction. This the author 
calls the equation of the wave-surface or surface of ray-velocities. 
It has the form of the equation of Fresnel’s wave-surface, expressed 
in terms of the direction-cosines and reciprocal of the radius vector, 
and if wu, v, w are the direction-cosines of the ray, and n the velocity 
of light am vacuo divided by the so-called ray-velocity in the crystal 
the equation will express Fresnel’s law. But it is impossible to give 
these meanings to wu, v7, w and m. ‘They are introduced into the 
discussion in the expression for the vibrations (p. 295), viz., 


pA cos te (f= Moet oy +8)) 


The form of this equation shows that u,v, w are proportional to the 
direction-cosines of the wave-normal, and as the relation u2+ ve+turz=1 
is afterwards used, they must be the direction-cosines of the wave- 
normal. They cannot possibly denote the direction-cosines of the ray, 
except in the particular case in which the ray and wave-normal 
coincide. Again, from the form of this equation, \/n must be the 
wave-length in the crystal, and if A here as elsewhere in the book 
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(see p. 25) denotes the wave-length in vacuo of light of the period 
considered, which we doubt not is the intention of the author, 7 must 
be the wave-length in vacuo divided by the wave-length in the 
crystal, ie., the velocity of light in vacuo divided by the wave-velocity 
in the crystal. With these definitions of wu, v, w, and n, equation (13) 
expresses a law which is different from Fresnel’s. Applied to the 
simple case of a uniaxial crystal, it makes the relation between the 
wave-velocity of the extraordinary ray and the angle of the wave- 
normal with the principal axis the same as that of the radius vector 
and the angle in an ellipse. The law of Huyghens and Fresnel makes 
the reciprocal of the wave-velocity stand in this relation. 

The law which our author has deduced has come up again and 
again in the history of theoretical optics. Professor Stokes (Report 
of the British Assoc., 1862, part i, p. 269) and Lord Rayleigh (Phil. 
Mag., (4), vol. xli, p. 525) have both raised the question whether 
Huyghens and Fresnel might not have been wrong, and it might not 
be the wave-velocity and not its reciprocal which is represented by 
the radius vector in an ellipse. The difference is not very great, for 
if we lay off on the radii vectores of an ellipse distances inversely 
proportional to their lengths, the resultant figure will have an oval 
form approaching that of an ellipse. when the eccentricity of the 
original ellipse is small. Rankine appears to have thought that the 
difference might be neglected (see Phil. Mag., (4), vol. i, pp. 444, 445) 
at least he claims that his theory leads to Fresnel’s law, while really 
it would give the same law which our author has found. (Concerning 
Rankine’s “splendid failure,’ and the whole history of the subject, see 
Sir Wm. Thomson’s Lectures on Molecular Dynamics at the Johns 
Hopkins University, chap. xx.) Professor Stokes undertook experi- 
ments to decide the question. His result, corroborated by Glazebrook 
(Pro. Roy. Soc., vol. xx, p. 443; Phil. Trans., vol. clxxi, p. 421), was 
that Huyghens and Fresnel were right and that the other law was 
wrong. 

To return to our author, we have no doubt from the context that 
he regards w, v, w, and n as relating to the ray and not to the wave- 
normal. We suppose that that is the meaning of his remark that 
the expression for the vibrations (quoted above) is to be referred to 
the direction of the ray. It seems rather hard not to allow a writer 
the privilege of defining his own terms. Yet the reader will admit 
that when the vibrations have been expressed in the above form an 
inexorable necessity fixes the significance of the direction determined 
by u,v, w, and leaves nothing in that respect to the choice of the 
author. 

The historical sketches of the development of ideas in the theory 
of optics, enriched by very numerous references, will be useful to 
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the student. An exception, however, must be made with respect to 
the statements concerning the electromagnetic theory of light. We 
are told (p. 450) that the English theory, founded by Maxwell and 
represented by Glazebrook and Fitzgerald, makes the plane of polari- 
zation coincide with the plane of vibration, while Lorentz, on the 
basis of Helmholtz’s equations comes to the conclusion that these 
planes are at right angles. Since all these writers make the electrical 
displacement perpendicular to the plane of polarization, we can only 
attribute this statement to some confusion between the electrical 
displacement and the magnetic force or “displacement” at right angles 
to it. We are also told that Glazebrook’s “surface-conditions ” which 
determine the intensity of reflected and refracted light are different 
from those of Lorentz,—a singular error in view of the fact that 
Mr. Glazebrook (Proc. Camb. Phil. Soc., vol. iv, p. 166) expressly 
states that his results are the same as those of Lorentz, Fitzgerald, 
and J. J. Thomson. We have spent much fruitless labor in trying to 
discover where and how the expressions were obtained which are 
attributed to Glazebrook, but in which the notation has been altered. 
They ought to come from Glazebrook’s equations (24)-(27) (loc. cit.), 
but these appear identical with Lorentz’s equations (58)-(61) (Zeit- 
schrift f. Math. u. Phys., vol. xxii, p. 27). They might be obtained by 
interchanging the expressions for vibrations in the plane of incidence 
and at right angles to it, with two changes of sign. 

The reader must be especially cautioned concerning the statements 
and implications of what has not been done in the electromagnetic 


theory. These are such as to suggest the question whether the author 


has taken the trouble to read the titles of the papers which have 
been published. We refer especially to what is said on pages 248, 
249 concerning absorption, dispersion, and the magnetic rotation of 
the plane of polarization. 

In the Experimental Part, with which the treatise closes, we have 
a comparison of formule with the results of experiments by the 
author and others. The author has been particularly successful in 
the formula for dispersion. In the case of quartz (p. 545), the 
formula (with four constants) represents the results of experiment in 
a manner entirely satisfactory through the entire range of wave- 
length from 2:14 to 0:214. Those who may not agree with the 
author’s theoretical views will nevertheless be glad to see the results 


of experiment brought together, and, so far as may be, represented by 
formule. 


XVII. 


ON THE VELOCITY OF LIGHT AS DETERMINED BY 
FOUCAULT'S REVOLVING MIRROR. 


(Nature, vol. Xxxml. p. 582, April 22, 1886.] 


It has been shown by Lord Rayleigh and others that the velocity 
(U) with which a group of waves is propagated in any medium may 
be calculated by the formula 


ie v(1 = 


d log x) 
dlogr/’ 
where V is the wave-velocity, and \ the wave-length. It has also 
been observed by Lord Rayleigh that the fronts of the waves reflected 
by the revolving mirror in Foucault's experiment are inclined one to 
another, and in consequence must rotate with an angular velocity 

dV 

axe 
where a is the angle between two successive wave-planes of similar 
phase. When dV/dX is positive (the usual case), the direction of 
rotation is such that the following wave-plane rotates towards the 
position of the preceding (see Nature, vol. xxv. p. 52). 

But I am not aware that attention has been called to the important 
fact, that while the individual wave rotates the wave-normal of the 
group remains unchanged, or, in other words, that if we fix our 
attention on a point moving with the group, therefore with the 
velocity U, the successive wave-planes, as they pass through that 
point, have all the same orientation. This follows immediately from 
the two formule quoted above. For the interval of time between 
the arrival of two successive wave-planes of similar phase at the 
moving point is evidently \/(V—U), which reduces by the first for- 
mula to dA/dV. In this time the second of the wave-planes, having 
the angular velocity adV/dX, will rotate through an angle a towards 
the position of the first wave-plane. But a is the angle between the 
two planes. The second plane, therefore, in passing the moving point, 
will have exactly the same orientation which the first had. To get a 
picture of the phenomenon, we may imagine that we are able to see 
a few inches of the top of a moving carriage-wheel. The individual 
spokes rotate, while the group maintains a vertical direction. 
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This consideration greatly simplifies the theory of Foucault's 
experiment, and makes it evident, I think, that the results of all 
such experiments depend upon the value of U, and not upon that of V. 

The discussion of the experiment by following a single wave, and 
taking account of its rotation, is a complicated process, and one in 
which it is very easy to leave out of account some of the elements of 
the problem. The principal objection to it, however, is its unreality. 
If the dispersion is considerable, no wave which leaves the revolving 
mirror will return to it. The individual disappears, only the group 
has permanence. Prof. Schuster, in his communication of March 11 
(p. 439), has nevertheless obtained by this method, as the quantity 
determined by “the experiments hitherto performed,” V?/(2V—U), 
which, as he observes, is nearly equal to U. He would, I think, 
have obtained U precisely, if for the angle between two successive 
wave-planes of similar phase, instead of 2wA/V, he had used the more 
exact value 2wA/U. 

By the kindness of Prof. Michelson, I am informed with respect to 
his recent experiments on the velocity of light in bisulphide of carbon 
that he would be inclined to place the maximum brilliancy of the 
light between the spectral lines D and E, but nearer to D. If we 
take the mean between D and E, we have 

eA K(2V—-U)_.. 

pali4s,  =Spp*=1°787, 
K denoting the velocity 1m vacuo (see p. 249 of this volume). The 
number observed was 1°76, “with an uncertainty of two units in the 
second place of decimals.” This agrees best with the first formula. 
The same would be true if we used values nearer to the line D. 


J. WILLARD GIBBS, 
New Haven, Connecticut, April 1. [1886.] 


XVIII. 


VELOCITY OF PROPAGATION OF ELECTROSTATIC FORCE. 
[Nature, vol. Lit. p. 509, April 2, 1896.] 


AS we may have to wait some time for the experimental solution 
of Lord Kelvin’s very instructive and suggestive problem concerning 
two pairs of spheres charged with electricity (see Natwre of February 
6, p. 316), it may be interesting to see what the solution would be 
from the standpoint of existing electrical theories. 

In applying Maxwell’s theory to the problem it will be convenient 
to suppose the dimensions of both pairs of spheres very small in 
comparison with the unit of length, and the distance between the 
two pairs very great in comparison with the same unit. These 
conditions, which greatly simplify the equations which represent the 
phenomena, will hardly be regarded as affecting the essential nature 
of the question proposed. 

Let us first consider what would happen on the discharge of (A, B), 
if the system (c, d) were absent. 

Let m, be the initial value of the moment of the charge of the 
system (A, B), (this term being used in a sense analogous to that 
in which we speak of the moment of a magnet), and m the value 
of the moment at any instant. If we set 

m= F(t), (1) 
and suppose the discharge to commence when ¢=0, and to be 
completed when ¢=h, we shall have 

F@)=m, when ¢<0, (2) 
and F()=0 when t>h, (3) 

Let us set the origin of coordinates at the centre of the system 
(A, B), and the axis of @ in the direction of the centre of the 
positively charged sphere. A unit vector in this direction we shall 
eall i, and the vector from the origin to the point considered p. 
At any point outside of a sphere of unit radius about the origin, 
the electrical displacement (D) is given by the vector equation 
AnD =[3r-5F (t—cr)+3cr-*F (t—er)+er-3F'"(t—cr) xp 

—[r-2F(t—cr)+er-2F (t—cr) +er-1F’”’ (t—er)}4, (4) 
where F denotes the function determined by equation (1), F’ and F” 
its derivatives, and c the ratio of electrostatic and electromagnetic 
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units of electricity, or the reciprocal of the velocity of light. For 
this satisfies the general equation ke 
VD = 0°’ D/ dt’ (5) 
as well as the so-called “equation of continuity,’ and also satisfies 
the special conditions that when t<0 
4nrD=m,(3r->xp — r-*2) 
outside of the unit sphere, and that at any time at the surface of 
this sphere 4nD = m(3xp—1), 
if we consider the terms containing the factor c as negligible, when 
not compensated by large values of v. That equation (4) satisfies 
the general conditions is easily verified, if we set 
w=r-1F(t—cr), (6) 
and observe that 
—-Vu=ed?u/dt?, (7) 
and that the three components of D are given by the equations 
Anf= —d?u/dy? —du/dz 
Aig =d?u/dady | 
4oh = d?u/dadz 
Equation (4) shows that the changes of the electrical displacement 
are represented by three systems of spherical waves, of forms 
determined by the rapidity of the discharge of the system (A, B), 


(8) 


which expand with the velocity of light with amplitudes diminishing 


as 7-3, r-2, and r-}, respectively. Outside of these waves, the electrical 
displacement is unchanged, inside of them it is zero. 

If we write (with Maxwell) —d%/dt for the force of electrodynamic 
induction at any point, and suppose its rectangular components 
calculated from those of —d?®/dt? by the formula used in calculating 
the potential of a mass from its density, we shall have by Poisson’s 
theorem 

V?(dU/dt) = 4rd? D/de?, 
or by (5), V2(dU/dt) = —4rc-2V2D, 
whence dU /dt = —4ac-?D. (9) 

From this, with (4), and the general equation 

AU /di +4re--D+VV =O, 
we see that during the discharge of the system (A, B) the electrostatic 
force —VV vanishes throughout all space, while its place is taken by 
a precisely equal electrodynamic force —d4/dt. 

This electrodynamie force remains unchanged at every point until 
the passage of the waves, after which the electrostatic force, the 


electrodynamic force, and the displacement, have the permanent value 
zero. 
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If we write Curl for the differentiating vector operator which 

Maxwell calls by that name, equations (8) may be put in the form 
47D = Curl Curl (iw), 
whence dD/dt =(47r)-! Curl Curl (¢ dw/dt). 

From dD/dt we may calculate the magnetic induction 8 by an 
operation which is the inverse of (47)-! Curl. We have therefore 

B= Curl (¢ du/dt), 
or B=[r-*F’(t—cr)+er-2F’(t—er)](yk—2zj). 
The magnetic induction is therefore zero except in the waves. 

Equations (4) and (9) give the value of d%{/dt as function of ¢ and 
vr. By integration, we may find the value of %{, Maxwell’s “vector 
potential.” This will be of the form of the second member of (4) 
multiplied by —c~-*, if we should give each F one accent less, and for 
an unaccented F should write F,, to denote the primitive of F which 
vanishes for the argument 0. 

That which seems most worthy of notice is that although simul- 
taneously with the discharge of the system (A, B) the values of what 
we call the electric potential, the electrodynamic force of induction, 
and the “vector potential,” are changed throughout all space, this does 
not appear connected with any physical change outside of the waves, 
which advance with the velocity of light. 

If we now suppose that there is a second pair of charged spheres 
(c, d), as in the original problem, the discharge of this pair will 
evidently occur when the relaxation of electrical displacement reaches 
it. The time between the discharges is, therefore, by Maxwell’s 
theory, the time required for light to pass from one pair to the other. 

It may also be interesting to observe that in the axis of , on both 
sides of the origin, zp =7r*, and equation (4) reduces to 

AD =[2r-*F(¢—cr)+ 2cr-?F(t—er)]1. 
Here, therefore, the oscillations are normal to the wave-surfaces. 
This might seem to imply that plane waves of normal oscillations 
may be propagated, since we are accustomed to regard a part of an 
infinite sphere as equivalent to a part of an infinite plane. Of course, 
such a result would be contrary to Maxwell’s theory. The paradox 
is explained if we consider that the parts of the wave-motion, 
expressed by F and F’, diminish more rapidly than those expressed by 
F’, so that it is unsafe to take the displacements in the axis of @ as 
approximately representing those at a moderate distance from it. In 
fact, if we consider the displacements not merely in the axis of #, but 
within a cylinder about that axis, and follow the waves to an infinite 
distance from the origin, we find no approximation to what is usually 


t by plane waves with normal oscillations. 
el J. WILLARD GIBBS. 
New Haven, Conn., March 12 [1896]. 

G. I. R 


XIX. 


FOURIER’S SERIES. 
[ Nature, vol. L1x, p. 200, Dec. 29, 1898.] 


I sHOULD like to add a few words concerning the subject of Prof. 
Michelson’s letter in Natwre of October 6. In the only reply which 
I have seen (Natwre, October 13), the point of view of Prof. Michelson 
is hardly considered. 

Let us write f,(«) for the sum of the first n terms of the series 


sina —}sin 2a4+4sin 32—4sin 4x+ ete. 


I suppose that there is no question concerning the form of the curve 
defined by any equation of the form 


y = 2fn(@). 

Let us call such a curve C,. As 1 increases without limit, the 
curve approaches a limiting form, which may be thus described. Let 
a point move from the origin in a straight line at an angle of 45° 
with the axis of X to the point (7, 7), thence vertically in a straight 
line to the point (7, —7), thence obliquely in a straight line to the 
point (37, 7), ete. The broken line thus described (continued 
indefinitely forwards and backwards) is the limiting form of the 
curve as the number of terms increases indefinitely. That is, if any 
small distance d be first specified, a number 1’ may be then specified, 
such that for every value of n greater than 7’, the distance of any 
point in C, from the broken line, and of any point in the broken line 
from C,, will be less than the specified distance d. 

But this limiting line is not the same as that expressed by the 


equation y=limit 2f,(z). 


The vertical portions of the broken line described above are 
wanting in the locus expressed by this equation, except the points in 
which they intersect. the axis of X. The process indicated in the 
last equation is virtually to consider the intersections of ©, with fixed 
vertical transversals, and seek the limiting positions when n is 
increased without limit. It is not surprising that this process does 


not give the vertical portions of the limiting curve. If we should — 


consider the intersections of C, with horizontal transversals, and 


: 
: 
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seek the limits which they approach when 7 is increased indefinitely 
we should obtain the vertical portions of the limiting curve as well as 
the oblique portions. 

It should be observed that if we take the equation 


y =2f,(2), 

and proceed to the limit for 7=00, we do not necessarily get y=0 for 
a=. We may get that ratio by first setting «=7, and then passing 
to the limit. We/may also get y=1, w=7, by first setting y=1, and 
then passing to the limit. Now the limit represented by the equation 
of the broken line described above is not a special or partial limit 
relating solely to some special method of passing to the limit, but it 
is the complete limit embracing all sets of values of « andy which 
can be obtained by any process of passing to the limit. 


J. WILLARD GIBBS. 
New Haven, Conn., November 29 [1898]. 


[ Nature, vol. LIx, p. 606, April 27, 1899.] 


I should like to correct a careless error which I made (Natwre, 
December 29, 1898) in describing the limiting form of the family of 
curves represented by the equation 

y=2(sina—5 sin 20... ++ sin ne) (1) 
as a zigzag line consisting of alternate inclined and vertical portions. 
The inclined portions were correctly given, but the vertical portions, 
which are bisected by the axis of X, extend beyond the points where 
they meet the inclined portions, their total lengths being expressed 
by four times the definite integral 

[ont au. 
0 UW 

If we call this combination of inclined and vertical lines C, and 
the graph of equation (1) C,, and if any finite distance d be specified, 
and we take for m any number greater than 100/d”, the distance of 
every point in C, from C is less than d, and the distance of every 
point in C from C,, is also less than d. We may therefore call C the 
limit (or limiting form) of the sequence of curves of which C, is the 
general designation. 

But this limiting form of the graphs of the functions expressed 
by the sum (1) is different from the graph of the function expressed by 
the limit of that sum. In the latter the vertical portions are wanting, 
except their middle points. 
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I think this distinction important, for (with exception of what 
relates to my unfortunate blunder described above) whatever differ- 
ences of opinion have been expressed on this subject seem due, for the 
most part, to the fact that some writers have had in mind the lemit . 
of the graphs, and others the graph of the limit of the sum. A mis- 
understanding on this point is a natural consequence of the usage 
which allows us to omit the word limit in certain connections, as 
when we speak of the sum of an infinite series. In terms thus 
abbreviated, either of the things which I have sought to distinguish 
may be called the graph of the sum of the infinite series. 


. J. WILLARD GIBBS. 
New Haven, April 12 [1899]. 


XX. 


RUDOLF JULIUS EMANUEL CLAUSIUS. 
[Proceedings of the American Academy, new series, vol. XVI, pp. 458-465, 1889.] 


Rupotr JuLtus EMANUEL CLAusIUS was born at Céslin in Pomer- 
ania, January 2,1822. His studies, after 1840, were pursued at Berlin, 
where he became Privat-docent in the University, and Instructor in 
Physics in the School of Artillery. He was Professor of Physics 
at Ziirich in the Polytechnicum (1855-67) and in the University 
(1857-67), at Wiirzburg (1867-69), and finally at Bonn (1869-88), 
where he died on the 24th of August, 1888. 

His literary activity commenced in 1847, with the publication of 
a memoir in Crelle’s Journal, “Ueber die Lichtzerstreuung in der 
Atmosphire, und iiber die Intensitét des durch die Atmosphiire 
reflectirten Sonnenlichts.”"* This was immediately followed by other 
writings relating to the same subject, two of which were subsequently 


translated from Poggendorf’s Annalent for Taylor’s Scientific 


Memoirs. A treatise entitled “Die Lichterscheinungen der Atmo- 
sphére” formed part of Grunert’s “Beitrage zur meteorologischen 
Optik.” 

An entirely different subject, the elasticity of solids, was discussed 
in his paper (1849), “ Ueber die Veranderungen, welche in den bisher 
gebrauchlichen Formeln fiir das Gleichgewicht und die Bewegung fester 
Korper durch neuere Beobachtungen nothwendig geworden sind.” { 

But it was with questions of quite another order of magnitude that 
his name was destined to be associated. The fundamental questions 
concerning the relation of heat to mechanical effect, which had been 
raised by Rumford, Carnot, and others, to meet with little response, 
were now everywhere pressing to the front. 

“For more than twelve years,” said Regnault in 1853, “I have 
been engaged in collecting the materials for the solution of this ques- 
tion :—Given a certain quantity of heat, what is, theoretically, the 
amount of mechanical effect which can be obtained by applying the 
heat to evaporation, or the expansion of elastic fluids, in the various 
circumstances which can be realised in practice?”§ The twenty-first 


* Vol. xxxiv, p. 122, and vol. xxxvi, p. 185. + Vol. lxxvi, pp. 161 and 188. 
t+ Pogg. Ann., vol. lxxvi, p. 46 (1849). § Comptes Rendus, vol. xxxvi, p. 676. 
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volume of the Memoirs of the Academy of Paris, describing the first 
part of the magnificent series of researches which the liberality of the 
French government enabled him to carry out for the solution of this 
question, was published in 1847. In the same year appeared Helm- 
holtz’s celebrated memoir, “Ueber die Erhaltung der Kraft.” For 
some years Joule had been making those experiments which were 
to associate his name with one of the fundamental laws of thermo- 
dynamics and one of the principal constants of nature. In 1849 he 
made that determination of the mechanical equivalent of heat by the 
stirring of water which for nearly thirty years remained the unques- 
tioned standard. In 1848 and 1849 Sir William Thomson was engaged 
in developing the consequences of Carnot’s theory of the motive 
power of heat, while Professor James Thomson in demonstrating the 
effect of pressure on the freezing point of water by a Carnot’s cycle, 
showed the flexibility and the fruitfulness of a mode of demonstration 
which was to become canonical in thermodynamics. Meantime 
Rankine was attacking the problem in his own way, with one of 
those marvellous creations of the imagination of which it is so 
difficult to estimate the precise value. 

Such was the state of the question when Clausius published his 
first memoir on thermodynamics: “Ueber die bewegende Kraft der 
Wiarme, und die Gesetze, welche sich daraus fiir die Warmelehre selbst 
ableiten lassen.” * 

This memoir marks an epoch in the history of physics. If we say, 
in the words used by Maxwell some years ago, that thermodynamics 
is “a science with secure foundations, clear definitions, and distinct 
boundaries,” t and ask when those foundations were laid, those defini- 
tions fixed, and those boundaries traced, there can be but one answer. 
Certainly not before the publication of that memoir. The materials 
indeed existed for such a science, as Clausius showed by constructing 
it from such materials, substantially, as had for years been the com- 
mon property of physicists. But truth and error were in a confusing 
state of mixture. Neither in France, nor in Germany, nor in Great 
Britain, can we find the answer to the question quoted from Regnault. 
The case was worse than this, for wrong answers were confidently 
urged by the highest authorities. That question was completely 
answered, on its theoretical side, in the memoir of Clausius, and the — 
science of thermodynamics came into existence. And as Maxwell said 
in 1878, so it might have been said at any time since the publication 
of that memoir, that the foundations of the science were secure, its 
definitions clear, and its boundaries distinct. 


*Read in the Berlin Academy, February 18, 1850, and published in the March and 
April numbers of Poggendorff’s Annalen. 
+ Nature, vol. xvii, p. 257. 
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The constructive power thus exhibited, this ability to bring order 
out of confusion, this breadth of view which could apprehend one 
truth without losing sight of another, this nice discrimination to 
separate truth from error,—these are qualities which place the 
possessor in the first rank of scientific men. 

In the development of the various consequences of the funda- 
mental propositions of thermodynamics, as applied to all kinds of 
physical phenomena, Clausius was rivalled, perhaps surpassed, in 
activity and versatility by Sir William Thomson. His attention, 
indeed, seems to have been less directed toward the development of 
the subject in extension, than toward the nature of the molecular 
phenomena of which the laws of thermodynamics are the sensible 
expression. He seems to have very early felt the conviction, that 
behind the second law of thermodynamics, which relates to the heat 
absorbed or given out by a body, and therefore capable of direct 
measurement, there was another law of similar form but relating to 
the quantities of heat (ie., molecular ws viva) absorbed in the 
performance of work, external or internal. 

- This may be made more definite, if we express the second law in 
a mathematical form, as may be done by saying that in any reversible 
cyclic process which a body may undergo 


where dQ is an elementary portion of the heat imparted to the body, 
and ¢ the absolute temperature of the body, or the portion of it which 
receives the heat. Or, without limitation to cyclic processes, we 
may say that for any reversible infinitesimal change, 
dQ=tdS8, 

where S denotes a certain function of the state of the body, called by 
Clausius the entropy. The element of heat may evidently be divided 
into two parts, of which one represents the increase of molecular 
vis viva in the body, and the other the work done against forces, 
either external or internal. If we call these parts dH and dQ, we 


have dQ=dH+dQ,,. 


Now the proposition of which Clausius felt so strong a conviction 
was that for reversible cyclic processes 


dQ 
[GE=° 


and that for any reversible infinitesimal change 
dQ,,=t dZ, 
where Z is another function of the state of the body, which he 
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called the disgregation, and regarded as determined by the positions 
of the elementary parts of the body without reference to their veloci- 
ties. In this respect it differed from the entropy. An immediate 
consequence of these relations is that for any reversible cyclic process 
aH 
[P= 
and therefore that H, the molecular vis viva of the body, must be 
a function of the temperature alone. This important result was 
expressed by Clausius in the following words: “Die Menge der in 
einem Kérper wirklich vorhandenen Wairme ist nur von seiner 
Temperatur und nicht von der Anordnung seiner Bestandtheile 
abhangig.” 
To return to the equation 


dQ, =tdZ. 


This expresses that heat tends to increase the disgregation, and that 
the intensity of this tendency is proportional to the absolute tempera- 
ture. In the words of Clausius: “Die mechanische Arbeit, welche 
die Warme bei irgend einer Anordnungsinderung eines Korpers thun 
kann, ist proportional der absoluten Temperatur, bei welcher die 
Aenderung geschieht.” 

Such in brief and in part were the views advanced by Clausius in 
1862, in his memoir, “Ueber die Anwendung des Satzes von der 
Aequivalenz der Verwandlungen auf die innere Arbeit.”* Although 
they were advanced rather as a hypothesis than as anything for 
which he could give a formal proof, he seems to have little doubt of 
their correctness, and his confidence seems to have increased with the 
course of time. 

The substantial correctness of these views cannot now be called in 
question. The researches especially of Maxwell and Boltzmann have 
shown that the molecular vis viva is proportional to the absolute 
temperature, and Boltzmann has even been able to determine the 
precise nature of the functions which Clausius called entropy and 
disgregation.t But the anticipation, to a certain extent, at so early a 
period in the history of the subject, of the ultimate form which the 
theory was to take, shows a remarkable insight, which is by no 
means to be lightly esteemed on account of the acknowledged want of 
a rigorous demonstration. The propositions, indeed, as relating to 
quantities which escape direct measurement, belong to molecular 
science, and seem to require for their complete and satisfactory 
demonstration a considerable development of that science. This 


* Pogg. Ann., vol. exvi, p. 73. See also vol. exxvii, p. 477 (1866). 
t Sitzungsberichte Wien. Akad., vol. xiii, p. 728 (1871). 
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development naturally commenced with the simplest case involving 
the characteristic problems of the subject,—the case, namely, of gases. 

The origin of the kinetic theory of gases is lost in remote antiquity, 
and its completion the most sanguine cannot hope to see. But a 
single generation has seen it advance from the stage of vague sur- 
mises to an extensive and well established body of doctrine. This is 
mainly the work of three men, Clausius, Maxwell, and Boltzmann, of 
whom Clausius was the earliest in the field, and has been called by 
Maxwell the principal founder of the science.* We may regard his 
paper (1857), “Ueber die Art der Bewegung, welche wir Warme 
nennen,’+ as marking his definite entrance into this field, although 
many points were incidentally discussed in earlier papers. 

This was soon followed by his papers, “Ueber die mittlere Lange 
der Wege, welche bei der Molecularbewegung gasfirmiger Kérper 
von den einzelnen Moleciilen zuriickgelegt werden,” { and “ Ueber die 
Warmeleitung gasférmiger Kérper.”$ 

A very valuable contribution to molecular science is the conception 
of the virial, defined in his paper (1870), “Ueber einen auf die Warme 
anwendbaren Satz,”|| where he shows that in any case of stationary 
motion the mean vis viva of the system is equal to its virial. 

In the mean time, Maxwell and Boltzmann had entered the field. 
Maxwell’s first paper, “On the Motions and Collisions of perfectly 
elastic Spheres,”{1 was characterized by a new manner of proposing 
the problems of molecular science. Clausius was concerned with the 
mean values of various quantities which vary enormously in the 
smallest time or space which we can appreciate. Maxwell occupied 
himself with the relative frequency of the various values which these 
quantities have. In this he was followed by Boltzmann. In reading 
Clausius, we seem to be reading mechanics; in reading Maxwell, and 
in much of Boltzmann’s most valuable work, we seem rather to be 
reading in the theory of probabilities. There is no doubt that the 
larger manner in which Maxwell and Boltzmann proposed the 
problems of molecular science enabled them in some cases to get 
a more satisfactory and complete answer, even for those questions 
which do not at first sight seem to require so broad a treatment. 

Boltzmann’s first work, however (1866), “Ueber die mechanische 
Bedeutung des zweiten Hauptsatzes der Warmetheorie,’** was in a 
line in which no one had preceded him, although he was followed by 


* Nature, vol. xvii, p. 278. 

+ Pogg. Ann., vol. ec, p. 353 (1857). 

t Ibid., vol. ev, p. 239 (1858). See also Wied. Ann., vol. x, p. 92. 
§ Ibid., vol. exv, p. 1 (1862). 

|| Zbid., vol. cxli, p. 124. See also Jubelband, p. 411. 

q Phil. Mag., vol. xix, p. 19 (1860). 

** Sitzungsberichte Wien. Akad. vol. liii, p. 195. 
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some of the most distinguished names among his contemporaries. 
Somewhat later (1870) Clausius, whose attention had not been called 
to Boltzmann’s work, wrote his paper, “ Ueber die Zuriickfiihrung des 
zweiten Hauptsatzes der mechanischen Warmetheorie auf allgemeine 
mechanische Principien.”* 

The point of departure of these investigations, and others to which 
they gave rise, is the consideration of the mean values of the force- 
function and of the vis viva of a system in which the motions are 
periodic, and of the variations of these mean values when the external 
influences are changed. The theorems developed belong to the same 
general category as the principle of least action, and the principle 
or principles known as Hamilton’s, which have to do, explicitly or 
implicitly, with the variations of these mean values. 

Among other papers of Clausius on this subject, we may mention 
the two following: “Ueber einen neuen mechanischen Satz in 
Bezug auf stationire Bewegung” + (1873), and “ Ueber den Satz vom 
mittleren Ergal und seine Anwendung auf die Molecularbewegungen 
der Gase” { (1874). 

The first problem of molecular science is to derive from the observed 
properties of bodies as accurate a notion as possible of their molecular 
constitution. The knowledge we may gain of their molecular con- 
stitution may then be utilized in the search for formulas to represent 
their observable properties. A most notable achievement in this 
direction is that of van der Waals, in his celebrated memoir, “On the 
Continuity of the Gaseous and Liquid States.” To this part of the 
subject belong the following papers of Clausius: “Ueber das Verhalten 
der Kohlensiure in Bezug auf Druck, Volumen und Temperatur,’§ 
and “Ueber die theoretische Bestimmung des Dampfdruckes und der 
Volumina des Dampfes und der Fliissigkeit”” (two papers). || 

Another matter in which Clausius showed his originality and 
power was the vexed subject of electrodynamics, as treated in his 
memoir, “Ueber die Ableitung eines neuen electrodynamischen 
Grundgesetzes.” 1 Various points in the theory of electricity in 
which the principles of thermodynamics or of molecular science 
were involved, had previously been treated in different papers, of 
which the earliest appeared in 1852,** while the doctrine of the 


* Pogg. Ann., vol. cxlii, p. 433. 

+ Ibid., vol. cl, p. 106. 

}Lbid., Erganzungsband vii, p. 215. 

§ Wied. Ann., vol. ix, p. 337 (1880). 

|| Zbid., vol. xiv, p. 279 and p. 692 (1881), 

I Crelle’s Journal, vol. lxxxii, p. 85 (1877). 

** «Ueber das mechanische Aequivalent einer electrischen Entladung und die dabei 
stattfindende Erwarmung des Leitungsdrahtes.” Pogg. Ann., vol. Ixxxvi,) “p. sore 
‘* Ueber die bei einem stationaren electrischen Strome in dem Leiter gethane Arbeit 
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potential (electrical and gravitational) was treated in a separate book, 
which appeared in 1859, with the title, “Die Potentialfunction und 
das Potential, ein Beitrag zur mathematischen Physik.” This 
subsequently went through several editions, in which it was revised 
and enlarged. All these subjects, with others, were brought together 
in a single volume, “Die mechanische Behandlung der Electricitiit,” 
which appeared in 1879, forming the second volume of his 
“Mechanische Warmetheorie.”* Later papers on electricity related 
to the principles of electrodynamics,+ electrical and magnetic units, | 
and dynamo-electrie machines.§ 

The Royal Society’s catalogue of scientific papers, and the excellent 
indices to the Annalen der Physik und Chemie, in which Clausius’s 
work usually appeared, render it unnecessary to enumerate in detail 
his scientific papers. The list, indeed, would be a long one. The 
Royal Society's catalogue gives seventy-seven titles for the years 
1847-1873. Subsequently twenty-five papers have appeared in the 
Annalen alone, and about half as many others elsewhere. 

But such work as that of Clausius is not measured by counting 
titles or pages. His true monument lies not on the shelves of 
libraries, but in the thoughts of men, and in the history of more 
than one science. 


und erzeugte Warme.” Pogg. Ann., vol. Ixxxvii, p. 415 (1852). ‘‘ Ueber die An- 
wendung der mechanischen Warmetheorie auf die thermoelectrischen Erscheinungen.” 
Pogg. Ann., vol. xc, p. 513 (1853). ‘* Ueber die Electricitiitsleitung in Electrolyten.” 
Pogg. Ann., vol. ci, p. 338 (1857). ; 

*The first volume of this work appeared in 1876, and contained the general theory 
with the more immediate consequences of the two fundamental laws. The third volume 
has not yet appeared, but it is expected very soon, edited by Professor Planck and 
Dr. Pulfrich. In a certain sense this work may be regarded as a second edition of an 
earlier one (1864 and 1867), which consisted of a reprint of papers and had the title 
‘«« Abhandlungen iiber die mechanische Warmetheorie.” 

+ Wied. Ann., vol. x, p. 608; vol. xi, p. 604. 

tJIbid., vol. xvi, p. 529; vol. xvii, p. 713. 

§ Lbid., vol. xx, p. 353; vol. xxi, p. 385. 
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HUBERT ANSON NEWTON. 
[American Journal of Science, ser. 4, vol. 111, pp. 359-376, May 1897.] 
(Read before the National Academy of Sciences, in April, 1897.) 


Husert ANSON NEWTON was born on March 19th, 1830, at 
Sherburne, N.Y., and died at New Haven, Conn., on the 12th day of 
August, 1896. He was the fifth son of a family of seven sons and 
four daughters, children of William and Lois (Butler) Newton. The 
parents traced their ancestory back to the first settlers of Massa- 


chusetts and Connecticut,* and had migrated from the latter to 


Sherburne, when many parts of central New York were still a 
wilderness. They both belonged to families remarkable for longevity, 
and lived themselves to the ages of ninety-three and ninety-four 
years. Of the children, all the sons and two daughters were living as 
recently as the year 1889, the youngest being then fifty-three years 
of age. William Newton was a man of considerable enterprise, and 
undertook the construction of the Buffalo section of the Erie canal, 
as well as other work in canal and railroad construction in New 
York and Pennsylvania. In these constructions he is said to have 
relied on his native abilities to think out for himself the solution 
of problems which are generally a matter of technical training. His 
wife was remarkable for great strength of character united with a 
quiet temperament and well-balanced mind, and was noted among 
her neighbors for her mathematical powers. 

Young Newton, whose mental endowments were thus evidently 
inherited, and whose controlling tastes were manifested at a very 
early age, fitted for college at the schools of Sherburne, and at the 
age of sixteen entered Yale College in the class graduating in 1850. 
After graduation he pursued his mathematical studies at New Haven 
and at home, and became tutor at Yale in January, 1853, when on 
account of the sickness and death of Professor Stanley the whole 
charge of the mathematical department devolved on him from the first. 


*Richard Butler, the great-grandfather of Lois Butler, came over from England 
before 1633, and was one of those who removed from Cambridge to Hartford. An 


ancestor of William Newton came directly from England to the New Haven colony 
about the middle of the same century. 
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In 1855, he was appointed professor of mathematics at the early 
age of twenty-five. This appointment testifies to the confidence 
which was felt in his abilities, and is almost the only instance in 
which the Yale Corporation has conferred the dignity of a full 
professorship on so young a man. 

This appointment being accompanied with a leave of absence for a 
year, in order to give him the opportunity to study in Europe, it was 
but natural that he should be attracted to Paris, where Chasles was 
expounding at the Sorbonne that modern higher geometry of which 
he was to so large an extent the creator, and which appeals so 
strongly to the sense of the beautiful. And it was inevitable that 
the student should be profoundly impressed by the genius of his 
teacher and by the fruitfulness and elegance of the methods which he 
was introducing. The effect of this year’s study under the inspiring 
influence of such a master is seen in several contributions to the 
Mathematical Monthly during its brief existence in the years 1858-61. 
One of these was a problem which attracted at once the attention 
of Cayley, who sent a solution. Another was a discussion of the 
problem “to draw a circle tangent to three given circles,” remarkable 
for his use of the principle of inversion. A third was a very 
elaborate memoir on the construction of curves by the straight edge 
and compasses, and by the straight edge alone. These early essays in 
geometry show a mind thoroughly imbued with the spirit of modern 
geometry, skilful in the use of its methods, and eager to extend the 
bounds of our knowledge. 

Nevertheless, although for many years the higher geometry was 
with him a favorite subject of instruction for his more advanced 
students, either his own preferences, or perhaps rather the influence 
of his environment, was destined to lead him into a very different 
field of research. In the attention which has been paid to astronomy 
in this country we may recognize the history of the world repeating 
itself in a new country in respect to the order of the development of 
the sciences, or it may be enough to say that the questions which 
nature forces on us are likely to get more attention in a new country 
and a bustling age, than those which a reflective mind puts to itself, 
and that the love of abstract truth which prompts to the construction 
of a system of doctrine, and the refined taste which is a critic of 
methods of demonstration, are matters of slow growth. At all events, 
when Professor Newton was entering upon his professorship, the 
study of the higher geometry was less consonant with the spirit of 
the age in this country than the pursuit of astronomical knowledge, 
and the latter sphere of activity soon engrossed his best efforts. 

Yet it was not in any of the beaten paths of astronomers that 
Professor Newton was to move. It was rather in the wilds of a 
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terra incognita, which astronomers had hardly troubled themselves to 
claim as belonging to their domain, that he first labored to establish 
law and order. It was doubtless not by chance that he turned his 
attention to the subject of shooting stars. The interest awakened in 
this country by the stupendous spectacle of 1833, which was not seen 
in Europe, had not died out. This was especially true at New Haven, 
where Mr, Edward C. Herrick was distinguished for his indefatigable 
industry both in personal observation and in the search for records of 
former showers. A rich accumulation of material was thus awaiting 
development. In 1861, the Connecticut Academy of Arts and Sciences 
appointed a committee “to communicate with observers in various 
localities for combined and systematic observations upon the August 
and November meteors.” In this committee Professor Newton was 
preeminently active. He entered zealously upon the work of col- 
lecting material by personal observation and correspondence and by 
organizing corps of observers of students and others, and at the same 
time set himself to utilize the material thus obtained by the most 
careful study. The value of the observations collected was greatly 
increased by a map of the heavens for plotting meteor-paths, which 
was prepared by Professor Newton and printed at the expense of the 
Connecticut Academy for distribution among observers. 

By these organized efforts, in a great number of cases observations 
were obtained on the same meteor as seen from different places, and 
the actual path in the atmosphere was computed by Professor Newton. 
In a paper published in 1865* the vertical height of the beginning 
and the end of the visible part of the path is given for more than 
one hundred meteors observed on the nights of August 10th and 
November 13th, 1863. It was shown that the average height of the 
November meteors is fifteen or twenty miles higher than that of 


the August meteors, the former beginning in the mean at a height — 


of ninety-six miles and ending at sixty-one, the latter beginning at 
seventy and ending at fifty-six. 

We mention this paper first, because it seems to represent the 
culmination of a line of activity into which Professor Newton had 
entered much earlier. We must go back to consider other papers 
which he had published in the meantime. 

His first papers on this subject, 1860-62,+ were principally devoted 
to the determination of the paths and velocities of certain brilliant 
meteors or fireballs, which had attracted the attention of observers in 
different localities. Three of these appeared to have velocities much 
greater than is possible for permanent members of the solar system. 
To another a particular interest attached as belonging to the August 


* Amer. Jour. Sci., ser. 2, vol. xl, p- 250. 
+ Amer. Jour. Sci., ser. 2, vol. xxx, p. 186; xxxii, p. 448; and xxxiii, p. 338. 
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shower, although exceptional in size. For this he calculated the 
elements of the orbit which would give the observed path and velocity. 
But the determination of the velocity in such cases, which depends 
upon the estimation by the observers of the time of flight, is neces- 
sarily very uncertain, and at best affords only a lower limit for the 
value of the original velocity of the body before it encountered the 
resistance of the earth’s atmosphere. This would seem to constitute an 
insuperable difficulty in the determination of the orbits of meteoroids, 
to use the term which Professor Newton applied to these bodies, 
before they enter the earth’s atmosphere to appear for a moment as 
luminous meteors. Yet it has been completely overcome in the case 
of the November meteors, or Leonids as they are called from the 
constellation from which they appear to radiate. This achievement 
constitutes one of the most interesting chapters in the history of 
- meteoric science, and gives the subject an honorable place among the 
exact sciences. 

In the first place, by a careful study of the records, Professor 
Newton showed that the connection of early showers with those of 
1799 and 1833 had been masked by a progressive change in the time 
of the year in which the shower occurs. This change had amounted 
to a full month between A.D. 902, when the shower occurred on 
October 13, and 1833, when it occurred on November 13. It is in 
part due to the precession of the equinoxes, and in part to the motion 
of the node where the earth’s orbit meets that of the meteoroids. 
This motion must be attributed to the perturbations of the orbits of 
the meteoroids which are produced by the attractions of the planets, 
and being in the direction opposite to that of the equinoxes, Professor 
Newton inferred that the motion of the meteoroids must be retrograde. 

The showers do not, however, occur whenever the earth passes the 
node, but only when the passage occurs within a year or two before 
or after the termination of a cycle of 32°25 years. This number is 
obtained by dividing the interval between the showers of 902 and 
1833 by 28, the number of cycles between these dates, and must 
therefore be a very close approximation. For if these showers did 
not mark the precise end of cycles the resultant error would be 
divided by 28. Professor Newton showed that this value of the cycle 
requires that the number of revolutions performed by the meteoroids 
in one year should be either 2+,1, or l+,), or gy. In other words, 
the periodic time of the meteoroids must be either 180°0 or 185'4 or 
3546 or 3766 days, or 33°25 years. Now the velocity of any body 
in the solar system has a simple relation to its periodic time and its 
distance from the sun. Assuming, therefore, any one of these five 
values of the periodic time, we have the velocities of the Leonids 
at the node very sharply determined. From this velocity, with the 
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position of the apparent radiant, which gives the direction of the 
relative motion, and with the knowledge that the heliocentrie motion 
is retrograde, we may easily determine the orbit. 

We have, therefore, five orbits from which to choose. The 
calculation of the secular motion of the node due to the disturbing 
action of the planets, would enable us to decide between these orbits. 

Such are the most important conclusions which Professor Newton 
derived from the study of these remarkable showers, interesting not 
only from the magnificence of the spectacle occasionally exhibited, 
but in a much higher degree from the peculiarity in the periodic 
character of their occurrence, which affords the means of the deter- 
mination of the orbit of the meteoroids with a precision which would 
at first sight appear impossible. 

Professor Newton anticipated a notable return of the shower in 
1866, with some precursors in the years immediately preceding, a 
prediction which was amply verified. In the meantime he turned 
his attention to the properties which belong to shooting stars in 
general, and especially to those average values which relate to large 
numbers of these bodies not belonging to any particular swarm. 

This kind of investigation Maxwell has called statistical, and has 
in more than one passage signalized its difficulties. The writer 
recollects a passage of Maxwell which was pointed out to him by 
Professor Newton, in which the author says that serious errors have 
been made in such inquiries by men whose competency in other 
branches of mathematics was unquestioned. Doubtless Professor 
Newton was very conscious of the necessity of caution in these 
inquiries, as is indeed abundantly evident from the manner in which 
he expressed his conclusions; but the writer is not aware of any 
passage in which he has afforded an illustration of Maxwell’s remark. 

The results of these investigations appeared in an elaborate memoir 
“On Shooting Stars,” which was read to the National Academy in 
1864, and appeared two years later in the Memoirs of the Academy.* 
An abstract was given in the American Journal of Science in 1865.+ 


The following are some of the subjects treated, with some of the more 
interesting results : 


The distribution of the apparent paths of shooting stars in azimuth 
and altitude. 

The vertical distribution of the luminous part of the real paths. 
The value found for the mean height of the middle point of the 
luminous path was a trifle less than sixty miles. 

The mean length of apparent paths. 

The mean distance of paths from the observer. 


* Vol. i, 3d memoir. t Series 2, vol. xxxix, p. 193. 
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The mean foreshortening of paths. 

The mean length of the visible part of the real paths. 

The mean time of flight as estimated by observers. 

The distribution of the orbits of meteoroids in the solar system. 

The daily number of shooting stars, and the density of the 
meteoroides in the space which the earth traverses. 

The average number of shooting stars which enter the atmosphere 
daily, and which are large enough to be visible to the naked eye, 
if the sun, moon and clouds would permit it, is more than seven 
and a half millions. Certain observations with instruments seem 
to indicate that this number should be increased to more than 
four hundred millions, to include telescopic shooting stars, and there 
is no reason to doubt that an increase of optical power beyond that 
employed in these observations would reveal still larger numbers of 
these small bodies. In each volume of the size of the earth, of the 
space which the earth is traversing in its orbit about the sun, there 
are as many as thirteen thousand small bodies, each of which is 
such as would furnish a shooting star visible under favorable 
circumstances to the naked eye. 

These conclusions are certainly of a startling character, but not of 
greater interest than those relating to the velocity of meteoroids. 
There are two velocities to be considered, which are evidently con- 
nected, the velocity relative to the earth, and the velocity of the 
meteoroids in the solar system. To the latter, great interest attaches 
from the fact that it determines the nature of the orbit of the 
meteoroid. A velocity equal to that of the earth, indicates an orbit 
like that of the earth; a velocity ,/2 times as great, a parabolic 
orbit like that of most comets, while a velocity greater than this 
indicates a hyperbolic orbit. 

Professor Newton sought to form an estimate of this critical 
quantity in more than one way. That on which he placed most 
reliance was based on a comparison of the numbers of shooting stars 
seen in the different hours of the night. It is evident that in the 
morning, when we are in front of the earth in its motion about the 
sun, we should see more shooting stars than in the evening, when we 
are behind the earth; but the greater the velocity of the meteoroids 
compared with that of the earth, the less the difference would be in 
the numbers of evening and morning stars.* 


*It may not be out of place to notice here an erratum which occurs both in the 
Memoirs of the National Academy and in the abstract in the American Journal of 
Science, and which the writer finds marked in a private copy of Professor Newton’s. 
In the table on p. 20 of the memoir and 206 of the abstract, the column of numbers 
under the head “‘hour of the night” should be inverted. There is another displace- 
ment in the table in the memoir, which is, however, corrected in the abstract, 
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After a careful discussion of the evidence Professor Newton reached 
the conclusion that “we must regard as almost certain-(on the 
hypothesis of an equable distribution of the directions of absolute 
motions), that the mean velocity of the meteoroids exceeds con- 
siderably that of the earth; that the orbits are not approximately 
circular, but resemble more the orbits of comets.” 

This last sentence, which is taken from the abstract published in 
the American Journal of Science in 1865, and is a little more 
definitely and positively expressed than the corresponding passage 
in the original memoir, indicating apparently that the author’s 
conviction had been growing more positive in the interval, or at 
least that the importance of the conclusion had been growing upon 
him, embodies what is perhaps the most important result of the 
memoir, and derives a curious significance from the discoveries which 
were to astonish astronomers in the immediate future. 

The return of the November or Leonid shower in 1865, and 
especially in 1866, when the display was very brilliant in Europe; 
gave an immense stimulus to meteoric study, and an especial 
prominence to this group of meteoroids. “Not since the year 
1759,” says Schiaparelli, “when the predicted return of a comet first 
took place, had the verified prediction of a periodic phenomenon 
made a greater impression than the magnificent spectacle of 
November, 1866. The study of cosmic meteors thereby gained the 
dignity of a science, and took finally an honorable place among 
the other branches of astronomy.’* Professor J. C. Adams, of 
Cambridge, England, then took up the calculation of the perturba- 
tions determining the motion of the node. We have seen that 
Professor Newton had shown that the periodic time was limited to 
five sharply determined values, each of which with the other data 
would give an orbit, and that the true orbit could be distinguished 
from the other four by the calculation of the secular motion of the 
node. 

Professor Adams first calculated the motion of the node due to the 
attractions of Jupiter, Venus, and the Earth for the orbit having a 
period of 3546 days. This amounted to a little less than 12’ in 
33:25 years. As Professor Newton had shown that the dates of the 
showers require a motion of 29’ in 33:25 years, the period of 3546 
days must be rejected. The case would be nearly the same with a 
period of 376°6 days, while a period of 180 or 185-4 days would give 
a still smaller motion of the node. Hence, of the five possible periods 
indicated by Professor Newton, four were shown to be entirely 
incompatible with the motion of the node, and it only remained to 


*“Schiaparelli, Lntwurf einer astronomischen Theorie der Sternschnuppen, p. 55 
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examine whether the fifth period, viz. that of 33:25 years, would give 
a motion of the node in accordance with the observed value. As 
this period gives a very long ellipse for the orbit, extending a little 
beyond the orbit of Uranus, it was necessary to take account of the 
perturbations due to that planet and to Saturn, Professor Adams 
found 28’ for the motion of the node. As this value must be regarded 
as sensibly identical with Professor Newton’s 29’ of observed motion, 
no doubt was left in regard to the period of revolution or the orbit of 
the meteoroids.* 

About this time, M. Schiaparelli was led by a course of reasoning 
similar to Professor Newton’s to the same conclusion,—that the mean 
velocity of the meteoroids is not very different from that due to 
parabolic orbits. In the course of his speculations in regard to the 
manner in which such bodies might enter the solar system, the 
questions suggested themselves: whether meteoroids and comets may 
not have a similar origin; whether, in case a swarm of meteoroids 
should include a body of sufficient size, this would not appear as a 
comet; and whether some of the known comets may not belong to 
streams of meteoroids. Calculating the orbit of the Perseids, or 
August meteoroids, from the radiant point, with the assumption of a 
nearly parabolic velocity, he found an orbit very similar to that of 
the great comet of 1862, which may therefore be considered as one 
of the Perseids,—probably the largest of them all.7 

At that time no known cometic orbit agreed with that of the 
Leonids, but a few months later, as soon as the definitive elements of 
the orbit of the first comet of 1866 were published, their resemblance 
to those of the Leonids, as calculated for the period of 33°25 years, 
which had been proved to be the correct value, was strikingly 
manifested, attracting at once the notice of several astronomers. 

Other relations of the same kind have been discovered later, of 
which that of Biela’s comet and the Andromeds is the most interesting, 
as we have seen the comet breaking up under the influence of the 
sun; but in no case is the coincidence so striking as in that of 
the Leonids, since in no other case is the orbit of the meteoroids 
completely known, independently of that of the comet, and without 
any arbitrary assumption in regard to their periodic time. 

The first comet of 1866 is probably not the only one belonging to 
the Leonid stream of meteoroids. Professor Newton has remarked 
that the Chinese annals mention two comets which passed rapidly in 
succession across the sky in 1366, a few days after the passage of the 
earth through the node of the Leonid stream, which was marked in 
Europe by one of the most remarkable star-showers on record. The 


* Monthly Notices Roy. Ast. Soc., vol. xxvii, p. 247. + Entwurf, etc., pp. 49-54. 
'G. Il. $2 
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course of these comets, as described by the annalists, was in the line 
of the Leonid stream.* " 

This identification of comets with meteors or shooting-stars marks 
an epoch in the study of the latter. Henceforth, they must be 
studied in connection with comets. It was presumably this discovery 
which led Professor Newton to those statistical investigations 
respecting comets, which we shall presently consider. At this point, 
however, at the close as it were of the first chapter in the history of 
meteoric science, it seems not unfitting to quote the words of an 
eminent foreign astronomer, written about this time, in regard to 
Professor Newton’s contributions to this subject. In an elaborate 
memoir in the Comptes Rendus, M. Faye says, with reference to our 
knowledge of shooting-stars and their orbits, “we may find in the 
works of M. Newton, of the United States, the most advanced 
expression of the state of science on this subject, and even the germ, 
I think, of the very remarkable ideas brought forward in these last 
days by M. Schiaparelli and M. Le Verrier.” + 

The first fruit of Professor Newton’s statistical studies on comets 
appeared in 1878 in a paper “On the Origin of Comets.” In this 
paper he considers the distribution in the solar system of the known 
cometic orbits, and compares it with what we might expect on either 
of two hypotheses: that of Kant, that the comets were formed in the 
evolution of the solar system from the more distant portion of the 
solar nebula; and that of Laplace, that the comets have come from the 
stellar spaces and in their origin had no relation to the solar system. 

In regard to the distribution of the aphelia, he shows that, except 
so far as modified by the perturbations due to the planets, the theory 
of internal origin would require all the aphelia to be in the vicinity 
of the ecliptic; the theory of external origin would make all direc- 
tions of the aphelia equally probable, ie. the distribution in latitude 
of the aphelia should be that in which the frequency is as the cosine 
of the latitude. The actual distribution comes very near to this, but 
as the effect of perturbations would tend to equalize the distribution 
of aphelia in all directions, Professor Newton does not regard this 
argument as entirely decisive. He remarks, however, that if Kant’s 
hypothesis be true, the comets must have been revolving in their 
orbits a very long time, and the process of the disintegration of comets 
must be very slow. 

In regard to the distribution of the orbits in inclination, the author 
shows that the theory of internal origin would make all inclinations 
equally probable; the theory of external origin would make all 


* Amer. Jour. Sci., ser. 2, vol. xliii, p- 298, and vol. xlv, p. 91, or Hncycl. Britann., 
article Meteor. 


+ Comptes Rendus, t. lxiv, p. 551. 
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directions of the normal to the plane of the orbit equally probable. 


On the first hypothesis, therefore, we should expect a uniform 
distribution in inclination; on the second, a frequency proportioned 
to the sine of the inclination. It was shown by a diagram in which 
the actual and the two theoretical distributions are represented 
graphically, that the actual distribution agrees pretty well with 
the theory of external origin and not at all with that of internal 
origin. It was also shown that the curve of actual distribution 
cannot be made to agree with Kant’s hypothesis by any simple 
and reasonable allowances for perturbations. On the other hand, 
if we assume the external origin of comets, and ask how the curve 
of sines must be modified in order to take account of perturbations, 
it is shown that the principal effect will be to increase somewhat 
the number of inclinations between 90° and 135° at the expense of 
those between 45° and 90°. It is apparent at once from the diagram 
that such a change would make a very good agreement between the 
actual and theoretical curves, the only important difference remaining 
being due to comets of short periods, which mostly have small 
inclinations with direct motion. These should not weigh very much, 
the author observes, in the general question of the distribution of 
inclinations, because they return so frequently and are so easily 
detected that their number in a list of observed comets is out of all 
proportion to their number among existing comets. But this group 
of comets of short periods can easily be explained on the theory of an 
external origin. For such comets must have lost a large part of 
their velocity by the influence of a planet. This is only likely to 
happen when a comet overtakes the planet and passes in front of it. 
This implies that its original motion was direct and in an orbit of 
small inclination to that of the planets, and although it may lose a 
large part of its velocity, its motion will generally remain direct and 
in a plane of small inclination. This very interesting case of the 
comets of short periods and small inclinations, which was treated 
rather briefly in this paper, was discussed more fully by Professor 
Newton at the meeting of the British Association in the following 
year.* 

Many years later, Professor Newton returned to the same general 
subject in a very interesting memoir “On the Capture of Comets by 
Planets; especially their Capture by Jupiter,” which was read before 
the National Academy in 1891, and appeared in the Memoirs of the 
Academy two years later.t It also appeared in the American 
Jowrnal of Science in the year in which it was read.{ This contains 


* Rep’t Brit. Assoc. Adv. Sci. for 1879, p. 272. 
+ Mem. Nat. Acad., vol. vi, 1st memoir. 
t Amer. Jour. Sci., ser. 3, vol. xlii, pp. 183 and 482. 
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the results of careful statistical calculations on the effect of per- 


turbations on orbits of comets originally parabolic. It corroborates 
the more general statements of the paper “On the Origin of Comets,” 
giving them a precise quantitative form. One or two quotations 
will give some idea of the nature of this very elaborate and curious 
memoir, in which, however, the results are largely presented in the 
form of diagrams. 

On a certain hypothesis regarding an original equable distribution 
of comets in parabolic orbits about the sun, it is shown that “if in 
a given period of time a thousand million comets come in parabolic 
orbits nearer to the sun than Jupiter, 126 of them will have their 
orbits changed” by the action of that planet “into ellipses with 
periodic times less than one-half that of Jupiter; 839 of them will 
have their orbits changed into ellipses with periodic times less than 
that of Jupiter; 1701 of them will have their orbits changed into 
ellipses with periodic times less than once and a half that of Jupiter, 
and 2670 of them will have their orbits changed into ellipses with 
periodic times less than twice that of Jupiter.” A little later, Pro- 
fessor Newton considers the question, which he characterizes as 
perhaps more important, of the direct or retrograde motion of the 
comets after such perturbations. It is shown that of the 839 comets 
which have periodic times less than Jupiter, 203 will have retrograde 
motions, and 636 will have direct motions. Of the 203 with retro- 
grade motion, and of the 636 with direct motion, 51 and 257, respec- 
tively, will have orbits inclined less than 30° to that of Jupiter. 

We have seen that the earliest of Professor Newton’s more important 
studies on meteors related to the Leonids, which at that time far 
surpassed all other meteoric streams in interest. One of his later 
studies related to another stream which in the mean time had acquired 
great importance. The identification of the orbit of the Andromed 
meteors with that of Biela’s comet, which we have already mentioned, 
gave these bodies a unique interest, as the comet had been seen to 
break up under the influence of the sun. Here the evolution of 
meteoroids was taking place before our eyes; and this interest was 
heightened by the showers of 1872 and 1885, which in Europe seem 
to have been unsurpassed in brilliancy by any which have occurred 
in this century. 

The phenomena of each of these showers were carefully discussed 
by Professor Newton. Among the principal results of his paper on 
the latter shower are the following :* 

The time of the maximum frequency of meteors was Nov. 27, 1885 
6 h. 15 m. Gr. m. t. The estimated number per hour visible at one 


* Amer, Jour. Sci., ser. 3, vol. xxxi, p. 409. 


——— ee 


SS 


ee 


HUBERT ANSON NEWTON. 279 


place was then 75,000. This gives a density of the meteoroids in 
Space represented by one to a cube of twenty miles edge. Three 
hours later the frequency had fallen to one-tenth of the maximum 
value. The really dense portion of the stream through which we 
passed was less than 100,000 miles in thickness, and nearly all would 
be included in a thickness of 200,000 miles. 

A formula is given to express the effect of the earth’s attraction 
on the approaching meteoroids in altering the position of the radiant. 
This is technically known as the zenithal attraction, and is quite 
important in the case of these meteors on account of their small 
relative velocity. The significance of the formula may be roughly 
expressed by saying that the earth’s attraction changes the radiant of 
the Biela meteors, toward the vertical of the observer, one-tenth of 
the observed zenith distance of the radiant, or more briefly, that the 
zenithal attraction for these meteors is one-tenth of the observed 
zenith distance. The radiant even after the correction for zenithal 
attraction, and another for the rotation of the earth on its axis, is not 
a point but an area of several degrees diameter. The same has been 
observed in regard to other showers, but the result comes out more 
distinctly in the present case because the meteors were so numerous 
and the shower so carefully observed. 

This implies a want of parallelism in the paths of the meteors, and 
it is a very important question whether it exists before the meteoroids 
enter our atmosphere, or whether it is due to the action of the 
atmosphere. 

Professor Newton shows that it is difficult to account for so large 
a difference in the original motions of the meteoroids, and thinks it 
reasonable to attribute a large part of the want of parallelism to the 
action of the atmosphere on bodies of an irregular form, such as we 
have every reason to believe that the meteoroids have, when they 
enter our atmosphere. The effect of the heat generated will be to 
round off the edges and prominent parts, and to reduce the meteor 
to a form more and more spherical. It is, therefore, quite natural 
that the greater portion of the curvature of the paths should be 
in the invisible portion and thus escape our notice. It is only in 
exceptional cases that the visible path is notably curved. 

But the great interest of the paper centers in his discussion of the 
relation of this shower to preceding showers, and to the orbit of 
Biela’s comet. The changes in the date of the shower (from Dec. 6 
to Nov. 27) and in the position of the radiant are shown to be related 
to the great perturbations of Biela’s comet in 1794, 1831, and 1841-2. 
The showers observed by Brandes, Dee. 6th, 1798, by Herrick, Dec. 7th, 
1838, and by Heis, Dec. 8th and 10th, 1847, are related to the orbit 
of Biela’s comet as it was in 1772; while the great showers of 1872 
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and 1885, as well as a trifling display in 1867, are related to the 
orbit of 1852.* 2 

Assuming, then, that the meteoroids which we met on the 27th of 
November, 1872, did not leave the immediate neighborhood of the 
Biela comet before 1841-2, we seem to have the data for a very 
precise determination of their orbit between those dates. The same 
is true of those which we met in 1885. The computation of these 
orbits, the author remarks, may possibly give evidence for or against 
the existence of a resisting medium in the solar system. 

In his last public utterance on the subject of meteors, which was 
on the occasion of the recent sesquicentennial celebration of the 
American Philosophical Society, Professor Newton returns to the Biela 
meteoroids, and finds in the scattering which they show in the plane 
of their orbit the proof of a disturbing force in that plane, and there- 
fore not due to the planets. The force exerted by the sun appears to 
be modified somewhat as we see it in the comet's tails, where indeed 
the attraction is changed into a repulsion. Something of the same 
sort on a smaller scale relatively to the mass of the bodies appears to 
modify the sun’s action on the meteoroids. 

In 1888 Professor Newton read a paper before the National 
Academy “Upon the relation which the former Orbits of those 
Meteorites that are in our collections, and that were seen to fall, 
had to the Earth’s Orbit.” This was based upon a very careful study 
of more than 116 cases for which we have statements indicating more 
or less definitely the direction of the path through the air, as well as 
94 cases in which we only know the time of the fall. The results are 
expressed in the following three propositions: 

1. The meteorites which we have in our cabinets and which were 
seen to fall were originally (as a class, and with a very small 
number of exceptions) moving about the sun in orbits that had 
inclinations less than 90°; that is, their motions were direct, not 
retrograde. 

2. The reason why we have only this class of stones in our collec- 
tions is not one wholly or even mainly dependent on the habits of 
men; nor on the times when men are out of doors; nor on the places 
where men live; nor on any other principle of selection acting at or 
after the arrival of the stones at the ground. Either the stones 
which are moving in the solar system across the earth’s orbit move 
in general‘in direct orbits; or else for some reason the stones which 


* It is a curious coincidence that the original discoverer of the December shower as 
a periodic phenomenon, Mr. Edward C. Herrick, should have been (with a companion, 
Mr. Francis Bradley,) the first to observe that breaking up of the parent body which 
was destined to reinforce the meteoric stream in so remarkable a manner. See Amer. 
Jour. Sci., ser. 3, vol. xxxi, pp. 85 and 88. 
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move in retrograde orbits do not in general come through the air to 
the ground in solid form. 

3. The perihelion distances of nearly all the orbits in which these 
stones moved were not less than 0°5 nor more than 1:0, the earth’s 
radius vector being unity. 

Professor Newton adds, that it seems a natural and proper corollary 
to these propositions (unless it shall appear that stones meeting the 
earth are destroyed in the air) that the larger meteorites moving in 
our solar system are allied much more closely with the group of 
comets of short period than with comets whose orbits are nearly 
parabolic. All the known comets of shorter periods than 33 years 
move about the sun in direct orbits that have moderate inclinations 
to the ecliptic. On the contrary, of the nearly parabolic orbits that 
are known only a small proportion of the whole number have small 
inclinations with direct motion. 


We have briefly mentioned those papers which seem to constitute 
the most important contributions to the science of meteors and comets. 
To fully appreciate Professor Newton’s activity in this field, it would 
be necessary to take account of his minor contributions.* 

Most interesting and instructive to the general reader are his 
utterances on occasions when he has given a résumé of our knowledge 
on these subjects or some branch of them, as in the address “On the 
Meteorites, the Meteors, and the Shooting Stars,” which he delivered 
in 1886 as retiring president of the American Association for the 
Advancement of Science, or in certain lectures in the public courses 
of the Sheffield Scientific School of Yale University, entitled “The 
story of Biela’s Comet” (1874), “The relation of Meteorites to 
Comets” (1876), “The Worship of Meteorites” (1889), or in the 
articles on Meteors in the Encyclopedia Britannica and Johnson’s 
Cyclopedia. 

If we ask what traits of mind and character are indicated by 
these papers, the answer is not difficult. Professor Klein has divided 
mathematical minds into three leading classes: the logicians, whose 
pleasure and power lies in subtility of definition and dialectic skill, 
the geometers, whose power lies in the use of the space-intuitions ; 
and the formalists, who seek to find an algorithm for every operation.t 
Professor Newton evidently belonged to the second of these classes, 
and his natural tastes seem to have found an equal gratification 
in the development of a system of abstract geometric truths, or 


* These were detailed in a bibliography annexed to this paper in Amer. Jour. Sci., 
ser. 4, vol, iil. 
+ Lectures on Mathematics (Kvanston), p. 2. 
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in the investigation of the concrete phenomena of nature as they 
exist in space and time. ~~ 

But these papers show more than the type of mind of the author ; 
they give no uncertain testimony concerning the character of the 
man. In all these papers we see a love of honest work, an aversion 
to shams, a distrust of rash generalizations and speculations based 
on uncertain premises. He was never anxious to add one more 
guess on doubtful matters in the hope of hitting the truth, or what 
might pass as such for a time, but was always willing to take 
infinite pains in the most careful test of every theory. To these 
qualities was joined a modesty which forbade the pushing of his 
own claims, and desired no reputation except the unsought tribute 
of competent judges. At the close of his article on meteors in the 
Encyclopedia Britannica, which has not the least reference to himself 
as a contributor to the science, he remarks that “meteoric science 
is a structure built stone by stone by many builders.” We may 
add that no one has done more than himself to establish the 
foundations of the science, and that the stones which he has laid 
are not likely to need relaying. 

The value of Professor Newton’s work has been recognized by 
learned societies and institutions both at home and abroad. He 
received the honorary degree of Doctor of Laws from the University 
of Michigan in 1868. He was president of the section of Mathematics 
and Astronomy in the American Association for the Advancement 
in Science in 1875, and president of the Association in 1885. On 
the first occasion he delivered an address entitled “A plea for the 
study of pure Mathematics”; on the second the address on Meteorites, 
etc, which we have already mentioned. Of the American Mathe- 
matical Society he was vice-president at the time of his death. 
_ In 1888 the J. Lawrence Smith gold medal was awarded to him 
by the National Academy for his investigations on the orbits of 
meteoroids. We may quote a sentence or two from his reply to 
the address of presentation, so characteristic are they of the man 
that uttered them: “To discover some new truth in nature,” he 
said, “even though it concerns the small things in the world, gives 
one of the purest pleasures in human experience. It gives joy to 
tell others of the treasure found.” 

Besides the various learned societies in our own country of which 
he was a member, including the American Academy of Arts and 
Sciences from 1862, the National Academy of Sciences from its 
foundation in 1863, the American Philosophical Society from 1867, 
he was elected in 1872 Associate of the Royal Astronomical Society 
of London, in 1886 Foreign Fellow of the Royal Society of Edinburgh, 
and in 1892 Foreign Member of the Royal Society of London. 
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But the studies which have won for their author an honorable 
reputation among men of science of all countries, form only one 
side of the life of the man whom we are considering. Another 
side, probably the most important, is that in which he was identified 
with the organic life of the College and University with which 
he had been connected from a very early age. In fact, we might 
almost call the studies which we have been considering, the recreations 
of a busy life of one whose serious occupation has been that of an 
instructor. If from all those who have come under his instruction 
we should seek to learn their personal recollections of Professor 
Newton, we should probably find that the most universal impression 
made on his students was his enthusiastic love of the subject which 
he was teaching. 

A department of the University in which he took an especial 
interest was the Observatory. This was placed under his direction 
at its organization, and although he subsequently resigned the 
nominal directorship, the institution remained virtually under his 
charge, and may be said to owe its existence in large measure 
to his untiring efforts and personal sacrifice in its behalf. 

One sphere of activity in the Observatory was suggested by a 
happy accident which Professor Newton has described in the American 
Journal of Science, September, 1893. An amateur astronomer in 
a neighboring town, Mr. John Lewis, accidentally obtained on a 
stellar photograph the track of a large meteor. He announced in 
the newspapers that he had secured such a photograph, and requested 
observations from those who had seen its flight. The photographic 
plate and the letters received from various observers were placed in 
Professor Newton's hands, and were discussed in the paper mentioned. 
The advantages of photographic observations were so conspicuous 
that Professor Newton was anxious that the Observatory should 
employ this method of securing the tracks of meteors. With the 
aid of an appropriation granted by the National Academy from the 
income of the J. Lawrence Smith fund, a battery of cameras was 
mounted on an equatorial axis. By this means, a number of meteor- 
tracks have been obtained of the August meteors, and in one case, 
through a simultaneous observation by Mr. Lewis in Ansonia, Professor 
Newton was able to calculate the course of the meteor in the atmo- 
sphere with a probable error which he estimated at less than a mile. 
The results which may be expected at the now near return of the 
Leonids, will be of especial interest, but it will be for others to utilize 
them. 

Professor Newton was much interested in the collection of meteorites, 
and the fine collection of stones and irons in the Peabody Museum 
of Yale University owes much to his efforts in this direction, 
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Professor Newton was a member of the American Metrological 
Society from the first, and was conspicuously active in the agitation 
which resulted in the enactment of the law of 1866, legalizing the 
use of the metric system. He prepared the table of the metric 
equivalents of the customary units of weights and measures which 
was incorporated in the act, and by which the relations of 
the fundamental units were defined. But he did not stop here. 
Appreciating the weakness of legislative enactment compared with 
popular sentiment, and feeling that the real battle was to be won 
in familiarizing the people with the metric system, he took pains to 
interest the makers of scales and rulers and other devices for measure- 
ment in adopting the units and graduations of the metric system, 
and to have the proper tables introduced into school arithmetics. 

He was also an active member of the Connecticut Academy of 
Arts and Sciences, serving several years both as secretary and 
president,—also as member of the council. He was associate editor 
of the American Jowrnal of Science from 1864, having especial 
charge of the department of astronomy. His notes on observations 


of meteors and on the progress of meteoric science, often very brief, 


sometimes more extended, but always well considered, were especially 
valuable. par caee 

In spite of his studious tastes and love of a quiet life, he did not 
shirk the duties of citizenship, serving a term as alderman in the 
city council, being elected, we may observe, in a ward of politics 
strongly opposed to his own. OSA een 

Professor Newton married, April 14th, 1859, Anna C., daughter 
of the Rev. Joseph C. Stiles, D.D., of Georgia, at one time pastor 
of the Mercer Street Presbyterian Church in New York City, and 


subsequently of the South Church in New Haven. She survived | 


her husband but three months, leaving two daughters. 


In all these relations of life, the subject of this sketch exhibited 


the same traits of character which are seen in his published papers, 
the same modesty, the same conscientiousness, the same devotion 
to high ideals. His life was the quiet life of the scholar, ennobled 
by the unselfish aims of the Christian gentleman; his memory will 
be cherished by many friends; and so long as astronomers, while 
they watch the return of the Leonids marking off the passage of 
the centuries, shall care to turn the earlier pages of this branch 
of astronomy, his name will have an honorable place in the history of 
the science. Ms, aa 
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